y<  i 

Q 

< 


DEVELOPMENT  OF  SIMULATED  DIRECTIONAL  AUDIO 
FOR  COCKPIT  APPLICATIONS  <U) 


THEODORE  J.  DOLL 
JEFFREYM.  GERTH 
WILLIAM  R.  ENGELMAN 
DENNIS  J.  FOODS 

SYSTEMS  ENGINEERING  LABORATORY 
GEORGIA  TECH  RESEARCH  INSTITUTE 
GEORGIA  INSTITUTE  OF  TECHNOLOGY 


JANUARY  1936 


Approved  for  Public  Release;  Distribution  is  Unlimited 


HARRY  G.  ARMSTRONG  AEROSPA  CE  MEDICAL  RESEARCH  LABORATORY 

AEROSPACE  MEDICAL  DIVISION 

AIR  FORCE  SYSTEMS  COMMAND 

WRIGHT- PATTERSON  AIR  FORCE  BASE,  OHIO  45433-6573 


H  *  *  *  *  *.*  -  *\*  -\  *  «»•*  U  *  *1  +  m  .  -  u  ■  4  \  -T.  ».  ■  ^  4% 


\%\%  V  V 


NOTICES 


When  US  Government  drawings,  specifications  or  other  data  are  used  for  any 
purpose  other  than  a  definitely  related  Government  procurement  operation, 
the  Government  thereby  incurs  no  responsibility  nor  any  obligation 
whatsoever,  and  the  fact  that  the  Government  may  have  formulated,  furnished, 
or  In  any  way  supplied  the  said  drawings,  specifications,  or  other  data,  is 
not  to  be  regarded  by  implication  or  otherwise,  as  In  any  manner  licensing 
the  holder  or  any  other  person  or  corporation,  or  conveying  any  rights  or 
permission  to  manufacture,  use,  or  sell  any  patented  invention  that  may  In 
any  way  be  related  thereto. 


Please  do  not  request  copies  of  this  report  from  Armstrong  Aerospace  Medical 
Research  Laboratory.  Additional  copies  may  be  purchased  from: 

National  Technical  Information  Service 
5285  Port  Royal  Road 
Springfield,  Virginia  221G1 


Federal  Government  agencies  and  their  contractors  registered  with  Defense 
Technical  Information  Center  should  direct  requests  for  copies  of  this  report 
to: 


Defense  Technical  Information  Center 

Cameron  Station 

Alexandria,  Virginia  22314 


TECHNICAL  REVIEW  AND  APPROVAL 

AAMRL-TR-86-014 


This  report  has  been  reviewed  by  the  Office  of  Public  Affairs  (PA)  and  is 
releasable  to  the  National  Technical  Information  Service  (NTIS).  At  NTIS  it 
will  be  available  to  the  general  public,  including  foreign  nations. 

This  technical  report  has  been  reviewed  and  is  approved  for  publication. 

FOR  THE  COMMANDER 


Director,  Human  Engineering  Division 
Armstrong  Aerospace  Medical  Research  Laboratory 


UNCLASSIFIED 


mcunitv  classic ic a t ion  op  this  rape 


REPORT  DOCUMENTATION  PAQE 


U,  MPOUTMCUHITV  CLASSIFICATION 

Unclassified 


lb.  RBITRICTIV9  MARKINOS 


a*.  S1CURITY  CLASSIFICATION  AUTHORITY 


3b.  OICLABSlRlCATION/OOWNORAOINO  SCHSOUUI 


3.  OISTRIBUTION/A  VAILASI LITV  OR  RSRORT 

Approved  for  public  release;  distribution  is 
unlimited. 


4  RSRRORMINO  ORGANISATION  RSRORT  NUMSSRISI 


S,  MONITORING  ORGANIZATION  RSRORT  NUMSSRISI 

AAMRL-TR- 86-014 


kb.  ORRICS  SYMBOL 
(KmUfki»\ 


Sa  NAMS  OR  RIRRORMINO  ORGANIZATION 

Georgia  Institute  of  Technology 
Georgia  Tech  Research  Institute 


7*.  NAMS  OR  MONITORING  ORGANIZATION 

AAMRL/HEA 


•«.  AOORSW  (Cily,  Hi* It  and  Xlf  Cod*) 

Atlanta,  GA  30132 


7b.  AOORStS  (City,  Hiatt  and  ZIP  Cod*) 

Wright-Patterson  AFB  OH  45433-6573 


Sa  NAMS  OR  RUNOING/SRONSORINO 
ORGANIZATION 


.  ORRICS  SYMBOL 
llf*H0lk  dbla) 


9.  RROCURIMSNT  IN9TRUMSNT  IOSNTIRICATION  NUMSSR 

Subcontract  1054-9A  in  support  of  prims 
contract  F33615-82-C-0513 


•c.  AOORT  AS  icily,  SHI*  and  2/R  Cod*) 

10.  SOURGI  OR  RUb 

ICING  NOS.  1 

RROGRAM 
SLIMSNT  NO, 

6.1101F 

RROJBCT 

NO. 

7184 

TASK 

NO. 

26 

WORK  UNIT 
NO. 

D2 

11,  TITLS  llntlud*  Ueurtty  Claulfltallon) 

SEE  REVERSE 

ia,  RSRSONAL  AUTHORIS) 

Poll.  Theodore  .T..  flerth.  Jeffrey  M. .  Knselmnn,  William  R.  nnd  Folds^  Dennis  Jj 


13*.  TYPI  OP  AMORT 

Technical 


13b.  TIMS  QOVIRSO 
RROM  Nov  RL  Tfl  .tAn  RA 


14.  OATS  OR  RIRORT  l»„  Mo.,  O* V) 

1986  January 


II,  RAGS  COUNT 

230 


1b,  SURRLIMSNTARY  NOTATION 


1 17.  COSATI  COOSS 

_  RISLO 

OROUR 

•us.  as. 

IS  SUBJECT  TIRMS  IConllnut  on  raaana  if  ntttuary  and  Idanttfy  by  bio *k  numbtrj 

Auditory  Localization  Head-Coupled  Control/Disolay 

Directional  Cues  Systems 

liiaaiUasgaiUilid  PmtlBf _ Three-Dimensional  Sound 


11.  ABITRACT  IContlnu*  on  ravtrt*  If  ntttuary  and  Idanllfy  by  Htoek  numbar) 

— ibThe  long-term  objective  of  this  work  is  to  develop  techniques  for  conveying  accurate 
spatial  information  via  audio  signals  delivered  to  the  listener  through  headphones  or  ear¬ 
phones.  This  project  included  three  major  activities:  (1)  an  extensive  review  and 
synthesis  of  the  research  literature  on  auditory  localization,  (2)  the  design,  fabrication, 
and  evaluation  of  an  armaratus  for  demonstrating  simulated  auditory  localization  (SAL),  and 
(3)  experimental  research  to  determine  characteristics  of  the  audio  signal ,  in  the  time 
and  frequency  domains,  which  enhance  localization  performance  with  simulated  cues. 

Previous  research  is  reviewed  which  describes  the  cues  involved  in  the  perception  of 
sound-source  direction,  both  horizontally  and  vertically,  when  the  head  is  stationary. 

Also  reviewed  is  research  on  auditory  distance  perception,  the  roles  of  head  movement  and 
vision  in  auditory  localization,  the  perception  of  auditory  motion  nnd  volume,  and  the  A 
effects  of  noise  on  auditory  localization.  A  feedback  control  model  (continued  on  back)  ^ 


/ 


»0.  OISTRI  BUTION/AVAI  LABILITY  OR  ABSTRACT 
UNCLASSIRIEO/UN  LIMIT!  D  □  SAMS  AS  RRT.  &  OTIC  USIRS  □ 


31.  ABSTRACT  SECURITY  CLASSIR ICATION 

UNCLASSIFIED 


331.  NAMS  OR  RIIRONSIbLI  INOIVIOUAL 

Gloria  L.  Calhoun 


33b  TBLBRHONR  NUMBER 
l/ncJuU*  .In'rt  ('otitfj 

(513)  255-7595 


DD  FORM  1473,  83  APR 


33c.  ORRICS  SYMBOL 

AAMRL/HEA 


COITION  OR  1  JAN  1 3  IS  OBSOLETE. 


UNCLASSIFIED 


SECURITY  CLASSIR ICATIOR!  OR  THIS  RAdi 


UNCLASSIFIED 


•CCuaiTV  CLASSIFICATION  OF  THIS  FAO( 


11.  TITLE:  Development  of  Simulated  Directional  Audio  for  Cockpit  Applications  (ll) 

20.  ABSTRACT  -  continued 

^  is  presented,  which  integrates  evidence  derived  from  four  different  theoretical  positions 
concerning  the  effects  of  head  movement  end  vision  on  auditory  localization.  Possible 
applications  of  SAL  technology  In  aircraft  cockpits  are  outlined,  and  the  potential 
benefits  of  such  applications  are  discussed.  Topic  areas  in  auditory  localization  are 
identified  where  further  basic  research  is  needed  to  support  the  development  of  SAL  into 
a  valuable  pilot  aid. 


The  design,  evaluation,  and  refinement  of  the  SAL  facility  is  described.  An 
experimental  test  of  the  psychological  fidelity  of  the  SAL  facility  is  summarized.  The 
results  ahow  that  the  facility  produces  a  high-fidelity  simulation  of  normal,  unaided 
auditory  localization. 

Two  experiments  are  described  in  which  listeners  localized  sounds  on  the  basis  of 
simulated  cues  delivered  dlchotlcally.  Localization  accuracy  and  response  time  were 
compared  for:  (1)  nine  different  filtered  noise  stimuli,  designed  to  make  available  some 
localization  cues  while  eliminating  others,  and  (2)  16  different  intermittent  noise 
stimuli,  formed  by  manlpulstlng  burst  duration,  duty  cycle,  rise  time,  and  the  frequency 
composition  of  the  noise.  The  stimulus  characteristics  had  relatively  little  effect  on 
localization  speed  and  accuracy.  It  la  concluded  that,  when  the  listener's  head  is  free 
to  move,  localization  performance  with  simulated  cues  is  relatively  insensitive  to  the 
character  of  the  audio  signal. 

A  long-term  research  and  development  plan  for  an  electronic  simulator  of  auditory 
localization  cues  is  presented.  The  simulator  would  be  suitable  for  airborne  applications, 
have  the  capability  to  impress  directional  qualities  on  incoming  signals  in  real  time, 
and  be  able  to  compensate  for  head  movements  so  as  to  make  directional  signals  appear 
to  be  stationary  in  apace.  Alternative  approaches  are  discussed  and  major  technical 
Issues  are  outlined. 


UNCLASSIFIED 


■aCUSITV  CLASSIFICATION  OF  THIS  FAOI 


PREFACE 


The  research  described  In  this  report  was  performed  by  the  Systems 
Engineering  Laboratory  of  the  Georgia  Tech  Research  Institute  (GTRI)  of  the 
Georgia  Institute  of  Technology.  The  project  was  sponsored  by  the  U.S.  Air 
Force  Armstrong  Aerospace  Medical  Research  Laboratory  (AAMRL)  under 
subcontract  number  1054--9A  with  MacAulay-Brown,  Inc.,  In  support  of  prime 
contract  number  F33615-82-C-0513. 

The  principal  investigator  and  project  director  for  this  was  Dr,  Theodore 
J.  Doll  of  GTRI.  The  Air  Force  project  technical  monitor  was  Ms.  Gloria 
Calhoun  of  the  AAMRL  Human  Engineering  Division,  Visual  Display  Systems  Branch 
(AAMRL/HEA). 

The  authors  wish  to  thank  the  following  individuals  for  their  help  in 
designing  and  building  parts  of  the  Simulated  Auditory  Localisation  (SAL) 
.'esearch  facility  i  Professor  W.  Marshall  Leach  of  the  Georgia  Tech  School  of 
Electrical  Engineering  and  Mr.  Harold  F.  Engler  and  Mr.  Richard  V.  Morrison  of 
GTRI.  Thanks  are  due  to  Dr.  Thomas  A.  Furness  and  Dr.  Robert  G.  Eggleston  of 
AAMRL/HEA  and  Dr.  Bryce  0.  Hartman  of  the  Air  Force  School  of  Aerospace 
Medicine  for  their  support  of  this  program.  The  authors  also  express  their 
appreciation  to  Mr.  Paul  H.  Cleveland  and  Mr.  J.  Craig  Wyvlll  of  GTRI  for 
loaning  the  project  valuable  pieces  of  test  equipment;  and  to  Knowles 
Electronics  Co.  for  providing  miniature  microphones  free  of  charge. 


TABLE  Of  CONTENTS 


Section 


Page 


1.0  INTRODUCTION 


1.1  Background 


11 


1.2  Objectives  12 

1.3  Summary  end  Conclusions  12 


2.0  LITERATURE  REVIEW  AND  RECOMMENDATIONS  16 

2.1  Introduction  16 

2.2  Localisation  in  the  Horisontal  Plane  16 

2.2.1  Interaural  Time  Differences  26 

2.2.2  Interaural  Amplitude  Differences  36 

2.2.3  Monaural  Cues  36 

2.3  Localisation  in  the  Median  and  Vertical  Planes  43 

2.3.1  Interaural  Differences  44 

2.3.2  Monaural  Cues  47 

2.4  Auditory  Distance  Perception  56 

2.5  Effects  of  Head  Movement  and  Vision  ou  Auditory  58 

Localisation 

2.6  Perception  of  Auditory  Motion  and  Volume  66 

2.7  Effects  of  Noise  on  Auditory  Localisation  69 

2.8  Cockpit  Applications  of  Simulated  Auditory  70 

Localisation  (SAL) 

2.8.1.  Directional  Cueing  71 

2.8.1. 1  Performance  Implications  of  75 

SAL  Directional  Cues 

2.8. 1.2  Other  Issues  Affecting  the  76 

Usefulness  of  SAL  Directional 

Cues 

2.8.2  Enhancement  of  Situational  Awareness  77 

2.8.3  Enhancement  of  Communications  and  78 

Audio  Warnings 

■2.9  Recommendations  for  Further  Research  79 


2 


TABLE  OF  CONTENTS 

(continued) 


Section  Page 

3.0  FACILITY  DESCRIPTION  84 

3.1  Facility  Overview  84 

3.2  Hardware  Design  84 

3.3  Software  Architecture  91 

3.3.1  Major  Component e  91 

3.3.2  Overview  of  Software  from  User's  91 

Perspective 

3.3.3  Real  Tima  Calculation  of  Sound  Fosition  95 

3.3.4  Calculation  of  Perceived  Sound  Direction  98 

3. 3.4.1  Calculation  of  Perceived  Sound  98 

Direction  Baaed  on  Finger  Position 

3. 3.4. 2  Calculation  of  Perceived  Sound  99 

Direction  Based  on  Head  Orientation 

and  Fosition 

3.3.5  Response  Time  Measurement  99 

3.4  Experimental  Rooms  100 

4,0  VALIDATION  OF  THE  EXPERIMENTAL  FACILITY  102 

4.1  Room  Acoustics  Tests  102 

4.2  Post-installation  Tests  of  the  Audio  Production  104 

and  Control  System 

4.2.1  Loudspeaker  Polarity  Check  104 

4.2.2  Amplitude  Response  as  a  Function  of  104 

Frequency 

4.2.3  Equalization  of  the  Loudspeakers  106 

4.2.4  Perceptual  Equivalence  of  Phantom  and  106 

Real  Sources 

4.3  Tests  of  the  Audio  Playback  System  107 

4.4  Real-Time  Control  of  Sound  Direction  111 

4.4.1  Interface  with  the  3-Space  Tracker  System  111 

4.4.2  Determination  of  Relay  and  Attenuator  Values  111 


3 


TABLE  OF  CONTENTS 

(concluded) 


A. 5  Psychological  Fidelity  Experiment  113 

4.5.1  Introduction  113 

4.5.2  Method  114 

4*5.2. 1  Experimental  Design  114 

4.5.2. 2  Procedure  116 

4.5. 2. 3  Subjects  119 

4.5.3  Results  119 

4.5.4  Discussion  and  Conclusions  132 


6.0  LONG-TERM  RESEARCH  AND  DEVELOPMENT  PLAN  167 

6.1  Objectives  167 

6.2  Approach  168 

6.3  Major  Technical  Issues  175 

6.3.1  System  Spatial  Resolution  175 

6.3.2  System  Response  Tims  177 

6.4  Proposed  Research  180 

7.0  APPENDIX:  ANNOTATED  BIBLIOGRAPHY  182 

8.0  REFERENCES  216 


4 


LIST  OF  FIGURES 


Method  used  by  Batteau  at  al.  (1965)  to  simulate 
auditory  localization. 

Average  transformation  of  sound  pressure  level  from 
free  field  to  human  eardrum  as  a  function  of  freouency 
at  24  angles  of  incidence  in  the  horizontal  plane. 

(From  Shew,  1982b.) 

Spherical  head  model,  showing  the  cone  of  confusion 
and  defining  the  spherical  coordinate  system.  (From 
Searle  et  al.,  1976.) 

Minimum  audible  angle  between  successive  pulses  of  tone 
as  a  function  of  the  frequency  of  the  tone  and  the 
direction  of  the  source  (  *  : 0 0 ;  o  <30*;  6:60*;  6:75*). 

(From  Mills,  1972.) 

Differences  between  the  distances  of  the  ears  from  a 
source  of  sound  far  enough  away  to  produce  a  nearly 
plane  wave  front.  (From  Mills,  1972.) 

Interaural  time  delay  for  tones  and  elides  as  a  func¬ 
tion  of  azimuth,  8  .  (From  Durlsch  6  Colburn,  1978.) 

Measured  end  predicted  interaural  time  delays  with  no 
torso  as  a  function  of  angle  of  incidence  '  *  *  measure¬ 
ments  at  360  Hz;  6  6  A  measurements  at  500  Hz;  77V  pre¬ 
dictions  based  on  Kuhn's  (1977)  model;  *  *  *  measurements 
at  3.0  kHz;  •  «  o  predictions  based  on  simple  geometry. 

Front-back  ambiguity  of  interaural  time  and  amplitude 
differences. 

Momentary  sound  pressure  level  at  the  left  (L)  and 
right  (R)  ears. 

Comparison  of  the  interaural  differences  in  phase  and 
intensity  that  can  just  be  detected  when  tone  pulses 
are  presented  through  earphones  with  the  Interaural 
differences  in  phase  and  in  Intensity  that  are  present 
when  an  actual  source  of  tone  pulses  is  moved  just 
noticeably  out  of  the  median  plane.  (From  Mills,  1972.) 

Interaural  Intensity  differences  as  a  function  of  the 
direction  of  the  source  and  the  frequency  of  sound. 

(From  Feddersen  et  al.,  1957.) 

Family  of  23  self-consistent  curves  showing  HRTF's 
normalized  to  0s  azimuth,  (^rom  Shaw,  1974.) 


LIST  OF  FIGURES 

(continued) 


Figure  Page 

13  Localization  of  connected-speech  signals  as  function  45 

of  transverse  to  median  plane  orientation.  (From 

Gardner  and  Gardner,  1973.) 

14  Mean  percent  correct  responses  for  ten  subjects  50 

localizing  low-  and  high-pass-filtered  white  noise 

with  accuracies  of  £  45#  (0)  and  _+  15°  (+).  Dotted 
lines  show  mean  accuracies  for  whits  noise. 

(From  Hebrank  and  Wright,  1974b.) 

15  Relative  frequencies  with  which  listeners  reported  52 

one-third  octave  noise  of  various  center  frequencies 

to  be  located  behind,  above,  and  in  front.  (From 
Blauert,  1969/70.) 

16  Spectral  filtering  created  by  combining  a  signal  with  54 

a  delay  of  itself.  Delays  less  than  30  Msec  cause  low- 

pass  filtering  and  delays  greater  than  30  Usee  cause 
multiple  spectral  notches.  (From  Wright,  Hebrank,  and 
Wilson,  1974.) 

17  Spectral  peaks  and  notches  generated  by  adding  white  55 

noise  to  two  delayed  echos  of  itself,  as  a  function  of 

one  of  the  delays,  Ty.  The  "above"  and  "below"  labels 
are  positioned  over  \  values  for  which  listeners 
reported  the  sound  source  to  be  high  or  low  in  the 
transverse  vertical  plane.  (From  Watkins,  1978.) 

18  Feedback  control  model  of  auditory  localization.  63 

19  Schematic  of  SAL  test  facility.  85 

20  Functional  diagram  of  the  SAL  system.  86 

21  Block  diagram  of  playback  system,  83 

22  Audio  controller  and  power  amplifier.  89 

23  Block  diagram  of  digital  control  logic  and  devices.  92 

24  Coordinate  system  used  in  the  real-time  calculation  96 

of  sound  paoition. 

25  SAL  experimental  room  layout.  101 


6 


LIST  OF  FIGURES 
(concluded) 


Figure 

Page 

26 

Input  signal  (upper  trace)  and  amplitude  response 
(lower  trace)  of  the  audio  production  and  control 
system. 

105 

27 

Amplitude  response  of  the  audio  playback  system. 

110 

28 

Parameters  of  the  Intermittent  noise  stimuli  used 

In  Experiment  II. 

149 

29 

RAM  requirements  for  off-line  pre-recording  approach. 

171 

30 

Conceptual  diagram  of  a  real-time  directional 
synthesizer  showing  the  digital  filtering  operation. 

173 

31 

RAM  requirements  for  real-time  synthesis  approach. 

174 

7 


LIST  OP  TABLES 


Table 

Page 

1 

Auditory  Localization  Cues  and  Their  Properties 

22 

2 

Estimated  Standard  Deviation  of  Localization 
(In  Degrees)  Attributable  to  Six  Cues 

24 

3 

Input  Parameters  in  the  Master  Test  File 

93 

4 

Sound  Pressure  Level  Output  (in  dBA)  at  the 

Left  and  Right  Headphones  ae  a  Function  of 

Input  Level  and  Type  of  Signal 

109 

5 

Mean  Localization  Error  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Type  of 

Source  and  Head  and  Listening  Conditions 

115 

6 

Azimuths  at  which  Sound  were  Presented  in  the 
Psychological  Fidelity  Experiment 

118 

7 

Number  of  Valid  Trials  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Sound 

Azimuth  and  Head  and  Listening  Conditions 

121 

8 

RMS  Localization  Error  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Type  of 

Source  and  Head  and  Listening  Conditions 

123 

9 

Msan  Response  Time  in  the  Psychological  Fidelity 
Experiment  as  a  Function  of  Type  of  Source  and 

Head  and  Listening  Conditions 

124 

10 

Mean  Localization  Error  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Actual 

Source  Azimuth  and  Head  and  Listening  Conditions 

126 

11 

RMS  Localization  Error  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Sound 

Azimuth  and  Head  and  Listening  Condi tiono 

127 

12 

Mean  Response  Time  in  the  Psychological  Fidelity 
Experiment  as  a  Function  of  Sound  Azimuth  and 

Head  and  Listening  Conditions 

128 

13 

RMS  Localization  Error  in  the  Psychological 

Fidelity  Experiment  as  a  Function  of  Listening 

Condition,  Blocks  of  Practice,  Sound  Azimuth, 
and  Head  Condition 

130 

8 


LIST  OF  TABLES 
(continued) 


Table 

14  Mean  Response  Time  in  the  Psychological  Fidelity 
Experiment  as  a  Function  of  Listening  Condition, 
Blocks  of  Practice,  Sound  Azimuth,  and  Head 
Condition 

13  Summary  of  Major  Results  of  the  Psychological 
Fidelity  Experiment 

16  Filtered  Noise  Stimuli  Used  in  Experiment  I 

17  Stimuli  for  Experiment  I  and  Localization  Cues 
Available 

18  RMS  Localization  Error  as  a  Function  of  Stimulus 
Noise  Type,  Source  Azimuth,  and  Day  ir.  Experiment  I 


19  Coefficients  for  Planned-Comparison  ANOVA's  in  141 

Experiment  I 

20  RMS  Localization  Error  as  a  Function  of  Stimulus  142 

Noise  Type  and  Day  in  Experiment  I 

21  RMS  Localization  Error  as  a  Function  of  Source  143 

Azimuth  and  Day  in  Experiment  I 

22  Mean  Localization  Error  as  a  Function  of  Source  145 

Azimuth  and  Day  in  Experiment  I 

23  Mean  Response  Time  as  a  Function  of  Stimulus  146 

Noise  Type  and  Day  for  Experiment  I 

24  Mean  Response  Time  as  a  Function  of  Source  147 

Azimuth  and  Day  for  Experiment  I 

25  Thresholds  for  Experiment  II  Stimuli  151 

26  Mean  Localization  Error  as  Function  of  153 

Stimulus  Variables  and  Source  Azimuth 

27  RMS  Localization  Error  as  a  Function  of  Burst  154 

Duration,  Duty  Cycle,  and  Source  Azimuth  for 

Experiment  II 

28  Linear  Component  of  the  Change  in  RMS  155 

Localization  Error  with  Source  Azimuth  as  a 

Function  of  Duty  Cycle  and  Burst  Duration 


Page 

131 

133 

137 

138 


140 


i 


9 


LIST  0?  TABLES 

(concluded) 


Table  Page 

29  RMS  Localization  Error  as  a  Function  of  Rise  157 

Tima,  Burst  Duration  and  Duty  Cycle 

30  RMS  Localization  Error  as  a  Function  of  Rise  15S 

Time,  Noise  Type,  and  Burst  Duration 

31  RMS  Localisation  Error  as  a  Function  of  Noise  160 

Type,  Burst  Duration,  and  Duty  Cycle 

32  Mean  Response  Time  as  Function  of  Source  Azimuth  161 

33  Mean  Response  Time  as  a  Function  of  Noise  Type  162 

and  Duty  Cycle 

34  Summary  of  Major  Results  of  Experiment  I  163 

35  Summary  of  Major  Resulta  of  Experiment  II  164 


10 


1.0  INTRODUCTION 


1.1  Background 

In  military  systems,  particularly  aircraft,  the  preponderance  of 
information  is  presented  to  the  human  operator  in  the  visual  morality.  During 
critical  periods  of  high  workload,  this  produces  an  overload  in  the  visual 
channel  which  can  reduce  pilot  and  system  effectiveness,  safety,  and 
survivability. 

The  visual  overload  problem  could  be  alleviated  by  making  better  use  of 
the  auditory  modality,  particularly  for  information  which  humans  customarily 
process  aurally  (c.f.,  Doll,  Folds,  &  Leiker,  1984;  Simpson,  1982;  Werkowitz, 
1981).  For  example,  in  the  natural  environment,  sounds  are  often  a  cue  as  to 
the  direction  of  important  objects  or  sources  of  information.  Upon  hearing  a 
novel  sound,  a  human  generally  rotates  his  or  her  head  and  eyes  toward  the 
source  and  acquires  it  visually.  Auditory  information  is  thus  a  natural  cue 
for  visual  acquisition  of  objects  in  the  environment. 

This  research  is  directed  toward  developing  ways  to  convey  accurate 
spatial  information  via  audio  signals  delivered  to  the  listener  through 
earphones  or  headphones.  Some  of  the  ways  in  which  such  simulated  directional 
audio  cues  could  be  used  to  enhance  the  performance  of  pilots  of  military 
aircraft  are  the  following: 

1.  Redirection  of  visual  attention 

Particularly  in  Head-Coupled  Control/Display  systems,  a  natural,  non 
workload-intensive  method  is  needed  to  cue  the  pilot  where  to  look. 

2.  Enhancement  of  situational  awareness 

Directional  audio  cues  could  be  used  to  help  keep  the  pilot  aware  of 
aircraft  attitude  in  low  visibility  conditions,  and  to  warn  of 
threats  or  closing  terrain  when  visual  attention  is  directed 
elsewhere. 

3.  Enhancement  of  communications  and  audio  warnings 

Giving  audio  messages  and  radio  communications  each  a  different 
apparent  direction  could  enhance  their  intelligibility  and  lessen 
pilot  response  time,  especially  in  noise  and  jamming  conditions. 


11 


1.2  Objectives 

The  specific  objectives  of  this  work  were  as  follows: 

1*  Design,  fabricate,  and  evaluate  an  apparatus  for  demonstrating 
simulated  auditory  localization  (SAL),  i.e.  auditory  localization 
based  on  simulated  cues  provided  via  headphones. 

2.  Perform  a  review  and  synthesis  of  the  research  literature  which 
provides  a  basis  for:  (a)  evaluating  the  feasibility  and  potential 
value  of  using  SAL  in  the  cockpit,  and  (b)  for  identifying  areas 
where  further  research  is  needed  to  develop  SAL  into  a  valuable 
pilot  aid. 

3.  Conduct  experimental  research  to  determine  characteristics  of  the 
auditory  stimulus,  in  the  time  and  frequency  domains,  which  enhance 
localization  performance  with  simulated  audio  cues. 

1.3  Summary  and  Conclusions 

Section  2.0  of  this  report  provides  an  extensive  review  of  the  research 
literature  on  auditory  localization.  The  first  part  of  the  review  focuses  on 
the  cues  involved  in  the  perception  of  sound-source  direction  and  distance 
relative  to  the  listener  when  the  listener's  head  is  stationary. 

The  experimental  evidence  indicates  that  at  least  five,  and  possibly  six, 
cues  play  a  role  in  the  perception  of  the  direction  of  a  sound  source  relative 
to  the  listener.  The  cues  are  known  to  the  extent  that  the  general  physical 
mechanism  which  produces  each  cue  (e.g. ,  the  pinnae  or  outer  ear)  and  the 
region  of  the  spectrum  in  which  each  cue  operates  can  be  specified.  The 
changes  in  the  acoustic  waveform  at  the  eardrums  or  ear  canal  entrance  as  a 
function  of  source  direction  have  been  precisely  measured  for  many  listeners, 
and  some  common  features  have  been  identified.  However,  such  waveforms, 
called  head-related  transfer  functions  (HRTF’s)  are  complex.  Consequently, 
there  exists,  to  date,  only  a  limited  understanding  of  what  features  of  the 
HRTF's  are  actually  used  by  the  ear  and  brain  as  cues  to  source  direction. 

In  the  study  of  auditory  distance  perception,  it  is  useful  to  distinguish 
between  absolute  and  relative  cues.  Absolute  cues  allow  the  listener  to  judge 
source  distance  in  the  absence  of  any  additional  information  about  the  source, 
e.g.,  its  spectral  composition  and  direction.  Absolute  cues  are,  of 
necessity,  binaural  or  involve  head  movement.  Initial  research  suggests  that 
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the  interaural  time  (or  phase)  difference  (TTD)  is  the  major  absolute  distance 
cue. 

Relative  cues  allow  the  listener  to  judge  the  absolute  distance  of  a 
sound  source  when  the  source  characteristics  (intensity  and  spectral 
composition)  are  known  or  assumed.  For  example,  one  can  judge  the  distance  of 
human  speakers  assuming  that  they  are  conversing  at  normal  conversational 
intensity.  Relative  cues  also  allow  a  listener  to  judge  whether  a  single, 
known  source  is  closer  or  further  away  on  two  successive  occasions.  Further 
research  is  needed  to  identify  all  the  cues  for  auditory  distance  perception, 
to  determine  how  they  interact,  and  to  quantify  how  they  vary  as  a  function  of 
source  distance. 

Section  2.5  reviews  research  on  the  roles  of  head  movement  and  vision  in 
auditory  localisation.  The  evidence  deriving  from  each  of  four  different 
theoretical  perspectives  is  reviewed  and  integrated  into  a  single  model  which 
encompasses  all  the  findings.  Auditory  localization  in  the  presence  of  head 
movement  and  vision  is  viewed  as  a  multiple-level  feedback  control  system. 
Visual  information,  prior  knowledge,  and  feedback  from  motor  activities 
(especially  eye  movements)  provide  important  inputs  which  help  to  control 
auditory  perception. 

Sections  2.6  and  2.7  review  the  research  on  the  perception  of  auditory 
motion  and  volume,  and  the  effects  of  noise  on  auditory  localization. 
Research  in  these  areas  is  still  in  its  infancy.  Since  these  topics  are 
crucial  to  applications  of  SAL  in  the  cockpit,  further  research  is  urgently 
needed.  Section  6.0,  the  long-term  research  and  development  plan,  presents  a 
program  of  proposed  research  in  this  topic  area. 

Sections  2.8  and  2.9  Include  a  discussion  of  possible  applications  of  SAL 
technology  in  aircraft  cockpits,  ths  potential  benefits  of  such  applications, 
and  topic  areas  in  auditory  localization  where  further  basic  research  Is 
needed. 

Another  major  objective  of  this  project  was  to  design,  build,  and 
evaluate  a  facility  for  producing  SAL  cues.  Section  3.0  describes  the  SAL 
research  facility,  now  in  operation  at  the  Georgia  Tech  Research  Institute 
(GTRI).  Section  4.0  reviews  a  series  of  tests  and  refinements  which  were  made 
to  the  SAL  facility  to  enhance  and  evaluate  its  psychological  fidelity.  It 
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was  concluded  that  the  SAL  facility  produces  a  high  fidelity  simulation  of 
normal,  free-field  auditory  localization. 

An  experiment  was  conducted  In  order  to  compare  localization  performance 
in  the  SAL  facility  (i.e. ,  with  simulated  cues)  to  that  in  normal,  unaided 
localization.  There  was  Initially  a  email  difference  in  localization  accuracy 
(about  3  degrees  RMS)  which  decreased  to  virtually  zero  after  about  one  hour 
of  practice.  The  average  time  to  localize  a  sound  source  with  simulated  cues 
was  slightly  longer  than  with  normal  cues  (about  3.4  sec  versus  2.5  sec),  and 
this  difference  was  still  present  after  an  hour  of  practice. 

Section  5.0  summarizes  research  which  examined  how  the  characteristics  of 
auditory  signals  in  the  time  and  frequency  domains  affects  their  usefulness  as 
directional  signals  in  the  cockpit.  Two  experiments  were  conducted  in  which 
human  listeners  localized  sounds  on  the  basis  of  simulated  cues  delivered  via 
headphones.  The  first  experiment  compared  localization  performance  in  the  SAL 
facility  for  high-  and  low-pass  filtered  noise  signals.  The  cut-off 
frequencies  of  the  stimuli  were  designed  to  pass  portions  of  the  spectrum 
associated  with  some  localization  cues,  while  rejecting  portions  associated 
with  others,  the  second  experiment  used  intermittent  signals  and  examined  the 
affects  on  localization  performance  of  burst  duration,  duty  cycle  (repetition 
rate),  and  rise  time  for  high-  and  low-frequency  noise  bursts.  In  contrast  to 
most  previous  studies  in  which  the  listener's  head  remained  stationary, 
stimulus  characteristics  had  relatively  little  effect  on  localization  speed 
and  accuracy.  It  is  concluded  that  modulation  of  the  received  sound  produced 
by  head  movement  allows  the  listener  to  judge  thci  source  direction,  and  that 
such  modulation  renders  cues  associated  with  the  spectral  and  time-domain 
composition  of  the  sound  much  less  Important  for  localization  than  whan  the 
head  la  stationary. 

Although  the  overall  effects  of  stimulus  characteristics  were  small,  the 
rise  time  and  repetition  rata  of  intermittent  stimuli  produced  trends  which 
have  significance  for  the  design  of  directional  auditory  displays. 
Spscifically,  shorter  rise  times  (e.g.,  1  msec)  and  higher  repetition  rates  (2 
Hz  and  greater)  produce  batter  localization  performance  than  longer  rise  times 
and  lower  repetition  rates. 

Overall,  the  results  of  the  experiments  suggest  that,  when  the  listener 
can  move  the  heed,  localization  performance  with  simulated  cues  Ls  relatively 
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Insensitive  to  the  character  of  the  audio  signal.  Thus  localization 

performance  should  not  be  seriously  degraded  for  nonoptiraal  audio  signals  such 
as  tones  and  speech,  as  long  as  the  listener's  head  Is  free  to  move* 

The  last  section  of  this  report  (6.0)  presents  a  long-term  research  and 
development  plan  for  an  electronic  simulator  which  could  convey  highly 
accurate  directional  Information  by  way  of  audio  signals  to  a  listener  wearing 
earphones.  The  simulator  will  be  suitable  for  airborne  applications  and  will 
have  the  capability  to  Impress  directional  qualities  on  incoming  signals  in 
real  time.  The  simulator  will  also  be  able  to  alter  the  sound  quality  in  real 
time  in  a  manner  coordinated  with  the  listener's  head  movement.  This  is 
necessary  in  order  to  make  a  sound  appsar  to  be  stationary  in  space  as  the 
listener  moves  the  head,  and/or  to  simulate  a  moving  sound  source.  Section 
6.2  examines  alternative  approaches  for  building  a  real-time  directional 
synthesizer  for  audio  signals,  and  Section  6.3  outlines  the  major  technical 
Issues  to  be  resolved.  Section  6.4  outlines  a  research  and  development  plan 
for  building  and  tasting  a  prototype  of  the  real-time  directional  synthesizer. 


2,0  LITERATURE  REVIEW  AMD  RECOMMENDATIONS 


2.1  Introduction 

This  section  Include*  *  review  of  the  basic  research  literature  on 
auditory  localisation  and  of  on*  major  attempt  to  use  auditory  localization  in 
a  human-machine  system  (Garner,  1949),  The  purpose  of  this  review  is  three¬ 
fold,  First,  the  review  provides  a  basis  for  evaluating  the  feasibility  of 
using  simulated  auditory  localisation  (SAL)  to  provide  directional  cues  in 
head^coupled  control/'dlfcplay  systems,  A  second  purpose  of  this  review  la  to 
Identify  other  potential  uses  of  SAL  in  human-machine  systems,  and  to  assess 
the  feasibility  of  each  such  use.  The  third  objective  is  to  identify 
requirements  for  further  basic  research  which  would  facilitate  the  development 
of  SAL  applications  in  human-machine  systems. 

The  first  topic  addressed  in  this  review  is  the  acoustic  cues  which 

enable  humans  to  perceive  the  location  of  a  sound  sourcs  when  the  listener's 
head  is  immobile.  Specifically,  the  first  three  subsections  of  this  review 
address  cues  to  sound-source  positions  in  the  horizontal  plane,  the  median 
plana,  and  cues  to  distance.  The  next  three  subsections  address  the  effects  of 
head  and  eye  movement  on  auditory  localisation,  the  perception  of  auditory 
motion  and  volume,  and  the  effects  of  noise  on  auditory  localization.  The 
final  two  subsections  discuss  possible  applications  of  SAL  in  the  cockpit  and 
present  recommendations  for  needed  basic  research  on  auditory  localization. 
An  annotated  bibliography  is  presented  in  Appendix  A. 

Throughout  this  report,  "horizontal  plana"  means  the  horizontal  plane 

passing  through  the  listener's  interaural  axis.  The  term  "median"  plane 
refers  to  the  median  sagittal  plane,  or  the  plane  of  symmetry  of  the  body. 
The  term  "cue"  refers  to  the  physical  characteristics  of  the  proximal  acoustic 
stimulus  at  the  listener's  ear  canal  entrance  or  eardrum  which  allow  him  or 
her  to  locate  the  source  in  space, 

2.2  Localization  in  the  Horizontal  Plane 

For  many  years,  "duplex"  theory  dominated  thinking  and  research  on 
auditory  localization.  Duplex  theory  holds  that  two  types  of  acoustic  cues, 
interaural  time  difference  (ITD)  and  interaural  amplitude  difference  (IAD) , 

account  for  localization  in  the  horizontal  plane.  Even  some  relatively  modern 
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sources  discuss  ITD  and  IAD  cues  extensively,  but  make  little  or  no  mention  of 
other  important  cues.  The  persistence  of  duplex  theory  is  probably  related  to 
the  fact  that  most  auditory  localization  research  has  been  performed  with  the 
stimulus  presented  via  headphones.  This  method  allows  the  experimenter  to 
freely  manipulate  the  proximal  stimulus  at  each  ear,  but  also  eliminates  cues 
due  to  head  movement  and  reflections  and  resonances  of  the  pinnae  and  torso. 

It  is  only  relatively  recently  that  the  cues  necessary  for  localization 
in  the  horizontal  plane  have  been  understood  well  enough  to  obtain  accurate 
localization  performance  from  a  listener  wearing  earphones.  Garner  (1949) 
summarized  several  unsuccessful  attempts  to  simulate  auditory  localization  for 
submarine  applications  as  follows: 

"When  the  problem  of  auditory  signaling  is  mentioned,  usually 
the  first  thing  that  comes  to  mind  is  the  use  of  some  indication  o-f 
lateral  displacement  of  a  sound  source.  If  time,  intensity,  and 
phase  differences  between  the  two  ears  are  the  cues  we  use  in 
localizing  a  sound  source,  then  it  should  be  simple  to  produce  an 
apparent  displacement  of  a  sound  source  by  stimulating  the  two  ears 
differently  in  one  of  these  respects.  Unfortunately,  it  is  not  as 
simple  as  that,  (p.212) 

...  Whatever  the  outcome  of  this  type  of  research,  it  is  clear 
that  more  research  is  needed  before  it  will  be  possible  to  simulate 
good  localization  of  sounds."  (p.  213) 

When  ITD  and  IAD  cues  are  simulated  and  presented  dichotically ,  via 
earphones,  the  resulting  sound  is  heard  as  if  it  were  inside  the  head.  Under 
these  conditions,  the  sound  can  be  made  to  subjectively  appear  to  the  listener 
as  though  it  were  located  at  various  positions  between  his  ears.  However,  the 
sound  always  remains  subjectively  within  the  head;  hence  this  type  of 
experiment  is  called  lateralization.  The  ability  to  judge  the  corresponding 
position  outside  the  head,  i.e.,  to  localize,  is  quite  poor  in  these 
conditions  (Mills,  1972). 

Batteau  and  his  colleagues  (Batteau,  1967,  Batteau,  Plante,  Spencer  & 
Lyle,  1963;  1965)  were  apparently  the  first  to  demonstrate  auditory 
localization  performance  using  earphones  with  accuracy  comparable  to  that 
achieved  with  the  unaided  ears.  Previous  researchers,  dating  back  to  the  19th 
Century,  had  speculated  that  the  external  ears,  or  pinnae,  play  a  role  in 
localization  by  altering  the  quality  of  the  sound  depending  on  its  direction 
of  origin  (Shaw,  1982a).  Bloch  (1893)  and  McLean  (1959)  demonstrated  that 
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distortion  of  the  pinnae  impairs  the  ability  to  localize  sounds  (cited  in 

Shaw,  1982b  and  Batteau,  1967,  respectively)*  Angell  and  Fite  (1901a, b) 

demonstrated  that  monaural  localization  is  possible,  although  not  with  the 
accuracy  of  binaural  stimulation,  and  they  suggested  that  the  pinna  plays  a 
role  In  localization  (cited  by  Mills,  1972  and  Gatehouse,  1982a)* 

The  method  used  by  Batteau  et  al.  (1965)  to  demonstrate  simulated 

auditory  localization  is  shown  In  Figure  1*  A  listener  was  stimulated 

dlohotlcally  using  semi-insertion-alectroatatic  headphones  with  the  signals 
recorded  from  a  pair  of  artificial  pinnae*  The  pinnae  were  mounted  on  high- 
fidelity  microphones,  which  were  attached  to  a  bar  and  separated  so  as  to 
correspond  to  the  width  of  a  human  head.  These  artificial  "ears"  were  located 
in  a  separate  room,  acoustically  Isolated  from  the  listener.  The  listener 
reported  the  apparent  azimuth  of  a  maraca  shaken  at  various  positions  around 
the  artificial  ears*  Tha  listener's  head  was  restrained  during  the  testing. 
When  the  artificial  pinnae  were  attached  to  the  microphones,  the  listener  was 
able  to  localize  tha  maraca  with  excellent  accuracy  In  both  azimuth  and 
elavation*  Without  the  artificial '  pinnae,  the  listener's  judgements  ware 
erratic  and  inaccurate*  Interestingly,  with  the  artificial  pinnae  attached  to 
the  microphonaa,  the  listener  reported  that  the  sound  appeared  subjectively  to 
be  located  some  distance  outside  tha  head,  rather  than  in  the  head,  as  sounds 
presented  via  earphones  are  normally  perceived. 

Since  the  Batteau  at  al.  (1965)  demonstration,  investigators  have 

continued  to  analyze  tha  cues  necessary  for  auditory  localization.  More 
recent  findings  suggest  that  auditory  localization  is  quite  complex,  depending 
on  as  many  as  six  different  physical  cues  whose  influence  varies  depending  on 
the  location  of  the  sound  source  relative  to  the  listener's  head  and  the 
frequency  of  the  stimulus*  A  number  of  investigators  have  measured  the 
transfer  functions  which  describe  how  the  spectrum  of  the  free-field  sound  is 
related  to  that  of  the  proximal  stimulus  at  the  ear  canal  entrance  or  at  the 
eardrum  (c*f«,  Flannery  &  Butler,  1981;  Gardner,  1973;  Mehrgardt  &  Mellert, 

1977;  Rodgers,  1981;  Shaw,  1974b;  Shaw  &  Taranishi,  1968;  Weinrich,  1982), 

Shaw  (1982a)  compiled  a  family  of  curves  representing  the  average 

transformation  of  sound  pressure  level  (SPL)  from  the  free  field  to  the  human 
eardrum  for  100  subjects  studied  in  12  separate  studies*  These  curves,  called 
Head-Related  Transfer  Functions  (HRTF's),  are  reproduced  in  Figure  2  and  ahow 
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Figure  1.  Method  used  by  Battaau  at  al.  (1965) 
to  simulate  auditory  localisation. 
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TRANSFORMATION  OF  SOUND  PRESSURE  LEVEL  FROM  FREE  FIELD  TO  EARDRUM  -  DECIBELS 


Figure  2.  Average  transformation  of  sound  pressure 
level  from  free  field  to  human  eardrum  as 
a  function  of  frequency  at  24  angles  of 
incidence  in  the  horizontal  plane.  (From 
Shaw,  1982b.) 
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SPL  at  the  eardrum  as  a  function  of  frequency  for  24  angles  of  incidence! 
There  are  clear  differences  as  a  function  of  angle*  Notice  the  substantial 
difference  for  sounds  in  the  anterior  and  posterior  directions  at  comparable 
angles.  For  example,  the  difference  In  SPL  at  4  kHz  for  sounds  located  at  45 
degrees  and  135  degrees  azimuth  la  about  10  db.  Shaw  concluded  that  the 
external  ear  produces  an  abundance  of  direction-dependent  cues  which  could 
serve  as  the  sole  basis  of  auditory  localization*  However,  it  is  well  known 
that  binaural  localization  is  more  accurate  than  monaural  localization,  hence 
lnteraural  differences  must  also  play  an  Important  role. 

Searle,  Braids,  Davis  &  Colburn  (1976)  developed  a  model  of  the  decoding 
process  in  auditory  localisation  based  on  statistical  decision  theory,  using 
data  from  studies  of  horizontal  and  vertical  localization  reported  over  a  ten- 
year  period.  They  conclude  that  there  are  six  cues  which  the  ear  and  brain 
uses  to  decode  the  direction  of  a  sound,  and  that  monaural  pinna  cues  are  of 
less  importance  than  lnteraural  pinna  cues  (see  Section  2.3.1  for  further 
discussion  of  this  point).  The  six  cues  and  their  properties  are  listed  in 
Table  1.  Figure  3  defines  the  angular  coordinates  used  in  Table  1.  Searle  at 
al.  (1976)  used  a  nonlinear  least-squares  procedure  to  estimate  the  standard 
deviation  of  localisation  possible  on  the  basis  on  each  cue  alone*  The 
estimates,  based  on  data  from  18  experiments,  are  shown  in  Table  2.  These 
estimates  are  for  binaural  localisation  in  the  horizontal  plana  with  a  broad¬ 
band  white  noise  stimulus  and  a  speaker  array  spanning  90*.  Although  1TD  and 
IAD  arc  among  the  most  important  cues,  it  is  clear  that  pinna  and  torso  cues 
also  play  a  role. 

Measurements  of  auditory  acuity  provide  another  kind  of  evidence  that 
localization  performance  may  be  based  on  several  different  types  of  cues. 
Mills  (1958)  measured  the  minimum  audible  angle  (MAA)  for  pure  tones  as  a 
function  of  frequency  and  the  direction  of  the  initial  sound.  The  listeners 
judged  whether  two  successive  sounds  came  from  the  same  or  different 
positions.  A  loudspeaker  was  sounded  at  an  initial  azimuth  value,  and  then 
moved  through  a  small  angle  to  a  second  position.  The  subject's  head  was 
restrained.  The  MAA  is  that  separation  between  the  first  and  second  speaker 
positions  at  which  the  subject  correctly  judged  whether  the  second  sound  came 
from  the  left  or  right  of  the  first  sound  on  75%  of  the  trials.  The  results 
are  shown  in  Figure  4.  The  MAA  is  about  1*  for  low  frequency  tones  directly 
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TABLE  1 


Auditory  Localization 

Cues  and  Their  Properties* 

Cue 

number 

Deaerlption 

Assumed 
spatial  . 
dependence® 

Useful 

frequency 

range 

A  priori 
spectral 
information 
required? 

1 

Interaural  time  delay 

6 

20  Hs-12  kHz 

No 

2 

Interaural  amplitude  dif¬ 
ference  arising  from 
head  shadow 

e 

1-12  kHz 

No 

3 

Monaural  head  ahadow 

e 

1-12  kHs 

Yes 

4 

Interaural  pinna  amplitude 

rasponsa 

fl.Y 

4-12  kHz 

No 

3 

Monaural  pinna  amplitude 

response 

®»Y 

4-12  kHz 

Yes 

6 

Amplitude  response  due  to 
shoulder  reflections 

e.Y 

2-3  kHz 

Yes 

■Adapted  from  Searle  «t  el.  (1$76). 

bSaa  Figure  3 


TABLE  2 


Estimat  i  Standard  Davlatlon  of  Localization  (In  Degrees) 
Attributable  to  Six  Cues* 


Cue  Description 

Standard  Deviation 
of  Localization^ 

Interaural  time  delay 

Interaural  heed  shadow 

►  3  5.6° 

Monaural  head  shadow  _ t 

Intaraural  pinna 

18. # 

Monaural  pinna 

23." 

Shoulder  bounce 

67.* 

*Adapatad  from  Saar la ,  at  al.  (1976) 

The  expected  value  of  the  atandard  deviation  of  localization  judgements 
assuming  only  the  corresponding  cue(a)  are  available. 

3 

The  standard  deviation  of  these  three  cues  could  not  be  estimated  separately, 
based  on  the  data  uaed. 
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Figure  4.  Minimum  audible  angle  between  aucceaelve 
pulees  of  tone  ee  a  function  of  the  fra~ 
quency  of  the  tone  and  the  direction  of 
the  eource  (•:()*;  •  : 30*  ;  ▲ : 60*  ;  A;75*). 
(From  Mills,  1972.) 


* 
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in  front  of  the  listener.  The  MAA  increases  as  the  azimuth  of  the  initial 
sound  increases,  to  the  point  where  it's  Indeterminately  large  at  some 
azimuths  and  frequencies.  The  curves  for  the  0*  and  30°  conditions  in  Figure 
4  exhibit  two  zones  in  which  acuity  deteriorates  and  then  improves  again  with 
increasing  frequency  (from  about  1200  to  2000  Hz  and  again  in  the  region 
around  8  kHz),  These  zones  may  represent  transitions  from  dependence  on  one 
type  of  cue  to  another.  These  results  suggest  that  horizontal-plane 
localization  is  based  on  three  or  more  types  of  acoustic  cues. 

In  the  following  section  is  examined  the  available  evidence  for  five  of 
the  six  suspected  cues  for  horizontal-plane  localization.  The  review  covers 
the  physical  characteristics  of  each  cue,  its  function  in  terms  of  auditory 
localization  performance,  and  what  little  is  known  about  the  interaction  of 
cues.  Discussion  of  the  sixth  cue,  binaural  pinna  disparity,  is  covered  in 
the  Section  2,3,  which  concerns  median-plane  localization, 

2,2,1  Interaural  Time  Differences 

Depending  on  its  angle,  6,  with  respect  to  the  listener's  head,  the  sound 
from  a  distant  source  arrives  at  one  ear  before  it  arrives  at  the  other.  A 
simple  geometrical  modal  of  the  difference  in  the  path  that  the  sound 
traversas  to  one  ear  versus  the  other  is  shown  in  Figure  5.  The  path  length 
difference  is: 


dd  -  r  <0  +  sin  0)  (1) 

Where  0  is  measured  in  radians  and  r  is  the  radius  of  the  head,  approximately 
8,75  cm.  The  ITD  for  a  sound  at  angle  0  is  then: 

(2) 

V.'hers  c  is  the  speed  of  sound,  about  343  m/s. 

The  ITD  varies  with  the  direction  of  the  sound  relatlvs  to  the  head  as 
shown  in  Figurs  6.  Tha  dotted  lines  represent  two  different  sets  of 
predictions.  The  lower  dotted  line  is  derived  from  equations  (1)  and  (2). 
This  line  fits  the  measured  ITU's  for  high  frequency  and  broad-band  sounds 
quite  well.  The  upper  dotted  curve  is  derived  from  the  equation: 
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Figure  5.  Diff trances  between  the  distances  of 
the  ear*  from  a  eource  of  sound  far 
enough  away  to  produce  a  nearly  plane 
wave  front.  (From  Mills,  1972.) 


0 


30 


Kdagrtas) 


Figure  6.  Intaraural  elm*  delay  for  tones  and  clicks 
as  a  function  of  aalmuth,  $  , 

(From  Durlach  &  Colburn,  1978.) 
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(3) 


At*  -  —  -  -  (3  tin  ©) 
c  c 

which  is  based  on  diffraction  theory.  It  provides  a  rough  approximation  for 
low  frequency  sounds. 

Kuhn  (1977)  developed  a  model  of  ongoing  ITD's  which  provides  a  better 
prediction  of  ITD's  for  low  frequency  sounds.  He  compared  the  model 
predictions  to  measured  ITD's  on  an  snthroporaetrically-scalsd  manikin  with 
pinnae.  The  results  are  shown  in  Figure  7*  The  1TD  is  greatest  at  low 
frequencies  (<  500  Hs)  and  roughly  frequency-independent  below  that  value. 
The  ITD  is  smeller,  end  egeln  frequency-independent  above  about  3.0  kHe.  For 
angles  less  than  60*  from  the  median  plane,  ITD  was  smallest  between  1.4  and 
1.6  kHz  (and  just  slightly  less  than  the  ITD  at  3.0  kHz).  Kuhn  also  noted 
that  the  rate  of  change  of  ITD  with  angle  of  incidence  is  greatest  near  the 
median  plane  (l.e. ,  when  the  sound  comes  from  in  front  of  the  listener). 
Therefore,  the  change  in  ITD  with  head  rotation,  if  permitted,  will  be 
greatest  when  the  source  is  located  in  the  frontal  region. 

At  low  frequencies  (<  1500  Hs),  the  ears  are  sensitive  to  both  ongoing 
ITD's  (or  lnteraural  phase  differences  -  IPD's)  and  to  transient  ITD's.  At 
high  frequencies  the  ears  are  sensitive  only  to  transient  ITD's. 

For  any  ITD,  ongoing  or  transient,  there  are  always  two  source  positions 
which  could  have  produced  the  ITD,  one  in  front  of  the  observer  and  the  other 
behind.  Note  in  Figure  8  that  for  source  position  B,  the  path  lengths  to  the 
left  and  right  ears  are  identical  to  those  for  source  position  A.  This  is  an 
idealization  since  real  heads  are  not  circular  and  the  ears  are  located  more 
than  90s  from  the  front  of  the  median  plane.  For  real  observers,  the  back 
source  position  producing  the  same  ITD  will  not  be  exactly  180*  minus  the 
front  position.  Listeners  frequently  make  front-back  reversals  in  judging 
sound  source  location,  especially  when  pinna  cues  ere  not  available  (cf. , 
Musicant  &  Butler,  1984a).  Front-back  ambiguities  are  discussed  further  in 
Section  2.2.3. 

Additional  ambiguities  occur  for  ongoing  ITDs  produced  by  pure  tones 
whenever  the  ITD  is  equal  to  or  larger  than  one-half  the  period  of  the 
acoustic  event.  When  this  condition  occurs,  the  listener  will  not  be  able  to 
distinguish  whether  the  source  is  located  on  the  left  side  of  the  head,  or  at 
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Figure  7.  Measured  end  predicted  lnteraural 
tine  diiays  with  no  toreo  ae  a 
function  of  angle  of  incidence 
•  •  •measurements  at  360 


the  mirror  image  position  or  the  right.  Figure  9  shows  a  time  line  and 
represents  the  ITD  as  an  interval  on  the  line.  Suppose,  as  shown,  that  a 
sound  coming  from  the  left  side  of  the  listener's  head  has  a  half-period  equal 
to  the  ITD.  Then,  at  a  given  moment  in  time,  the  left  ear  (L)  is  located  at  a 
pressure  maximum  whan  tha  right  ear  (R)  is  at  a  minimum.  Now  consider  a  sound 
coming  from  the  right  side  of  the  head  which  is  1 80®  out  of  phase  with  the 
sound  coming  from  the  left.  It  also  produces  a  maximum  at  L  when  there  is  a 
minimum  at  R.  The  listener  cannot  perceive  the  absolute  phase  of  a  sound; 
hanca  ha  or  aha  cannot  distinguish  such  a  sound  coming  from  the  left  from  a 
sound  from  the  right. 

The  difference  in  the  acoustic  path  length  to  the  two  ears  is  greatest 

t 

when  the  sound  epproechae  from  0  ■  90*.  This  condition  therefore  produces 
ambiguous  ITD's  starting  at  the  lowest  frequency  of  any  direction  of 
arrival.  For  such  a  sound,  the  shortest  aerial  path  from  ear  to  ear  is  about 
23  cm.  Thie  distance  exceeds  half  a  wavelength  (and  therefore  ITD  exceeds 
half  a  period)  at  frequencies  of  about  750  Ha  and  graatar. 

Since  the  ITD  decreases  ee  the  direction  of  the  sound,  0,  approaches  0° 
or  180*,  higher  frequency  tones  produce  unambiguous  ITD's  in  Chess  regions. 
For  example,  at  0  «  +_45,  tones  of  frequency  up  to  1300  Hs  produce  an 
•mamblguoua  ITD  (tha  llstaner  can  distinguish  vhsthar  the  sound  is  coming  from 
the  left  or  right  side  of  the  head,  but  not  nacassarlly  from  front  versus 
back).  Physiological  evidence  indicates  that  individual  neurons  of  the 
auditory  nerve  can  fire  in  synchrony  with  e  periodic  sound  source  only  up  to 
about  1000  Hs,  due  to  the  finite  refractory  period  of  the  neuron.  Above  that, 
bundles  of  neurons  can  follow  a  pariodlc  sound  only  up  to  about  1600  Hz 
(Taaakl,  1934),  Tharefora,  no  mattar  how  small  the  ITD,  the  ear  can  only 
follow  the  intereurel  time  (or  phase)  difference  created  by  a  periodic  source 
up  to  about  1600  Hs. 

Flgura  10  shows  tha  physical  IPD'e  and  IAD'e  produced  by  moving  a  source 
a  juet-noticeable-distance  from  tha  median  plane.  Also  shown  are  the 
thresholds  for  IPD  and  XAD  for  tone  pulses  presented  dichotically,  via 
earphones.  The  intereurel  phase  juet-noticeable  difference  (IPjnd),  or 
threshold,  and  the  intcreural  amplitude  juet-noticeable  difference  (IAjnd)  are 
greater  then  the  physical  IPD  and  IAD  because  the  threshold  (jnd)  values  of 
IPD  end  IAD  were  taken  as  those  values  that  the  listener  correctly  detected  on 
731  of  the  presentations  (see  Mills,  1972). 
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SOUND  COMII 


•  IDS  OV  HKAD 


PATH  TIMS 
DIPPININCI 
PNOMSOUNCI 
TO  LAS 


SOUND  COMINO  PNOM  NIOHT  SIDS  OP  HSAD 


Figure  9.  Monantary  sound  pressure  level  at  the  left  (L) 
and  right  (R)  ears.  See  text  for  explanation. 
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auditory  localization 


Figure  10.  Comparison  of  the  intsraural  differences  in  phase 
and  intensity  that  can  just  be  detected  whan  tone 
pulses  are  presented  through  earphones  with  the 
intsraural  differences  in  phase  and  in  intensity 
that  are  present  when  an  actual  source  of  tone 
pulses  Is  moved  just  noticeably  out  of  the  median 
plane.  (From  Mills,  1972.) 
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The  important,  feature  of  Figure  10  is  that  the  IPjnd  follows  the  physical 
IPD  up  to  about  1200  He  and  then  becomes  indeterminately  large.  At  about  1400 
Hz,  the  IAjnd  converges  with  the  physical  IAD  and  remains  close  to  it  up  to 
about  6000  Hz.  Above  6000  Hz,  the  physical  IAD  becomes  much  larger  than  the 
IAjnd.  These  data,  along  with  the  transition  from  1200  Hz  to  about  2000  Hz 
seen  previously  in  Figure  4,  suggest  that  localization  of  pure  tones  is  based 
on  IPD  up  to  about  1200-1500  Hz  and  on  IAD  above  1500  Hz.  The  data  in  Pigure 
10  provide  no  explanation  for  the  transition  seen  around  8  kHz  in  Figure  4. 

Although  the  auditory  system  is  insensitive  to  ITD  for  pure  tones  at  high 
frequency,  It  is  sensitive  to  ITD  for  some  complex  high-frequency  sounds. 
Sensitivity  to  ITD  for  high  frequency  sounds  has  been  demonstrated  when  either 
of  two  conditions  are  met:  (1)  the  sound  has  a  low  frequency  periodic 

envelope,  or  (2)  the  sound  includes  a  transient  component. 

A  study  by  McFadden  &  Pasanen  (1976)  provides  an  example  of  the  latter 
condition.  Using  dlchotic  headphone  presentation,  they  measured  the  threshold 
for  80X  correct  lateralization  of  a  two-tone  complex.  The  tones  were  both  in 
the  4000  Hz  region,  and  differed  in  frequency  by  25  to  550  Hz  in  different 
conditions.  An  lnteraurally  uncorrelated,  low-pass  noise  was  used  to  mask  the 
difference  tone  created  by  the  mixing  of  the  two  primary  tones.  The  complex 
tone  was  presented  for  200  msec  with  25  msec  rise  and  decay  times.  Listeners 
detected  ITD's  as  small  as  27  Msec,  based  apparently  on  the  transient 
(envelope)  features  of  the  stimulus. 

The  former  condition  for  sensitivity  to  ITD's  at  high  frequency  is 
illustrated  by  a  study  by  Yost,  Wlghtman,  and  Green  (1971).  listeners  were 
asked  to  discriminate  between  high-  and  low-pass  filtered  clicks  which  were 
presented  simultaneously  (diotlcally)  or  at  a  time  offset  (dichotically) . 
When  a  simple  click  was  presented,  high-pass  clicks  were  more  difficult  to 
lateralize  than  were  low-pass  clicks.  However,  when  the  clicks  were  repeated 
64  times  during  the  observation  Interval,  high-  and  low-pass  clicks  were 
lateralized  equally  well.  Apparently,  the  auditory  system  is  sensitive  to  the 
ongoing  ITD  produced  by  a  low-frequency  periodic  envelope. 

The  detectability  and  usefulness  of  ITD's  produced  by  transient  sounds  is 
not  limited  by  the  refractory  period  of  the  neurons  or  phase  ambiguities. 
Hence  transient  ITD's  are  effective  even  at  frequencies  ss  high  as  5000  Hz 
( cf . ,  Yost  et  al. ,  1971). 
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2.2.2  Interaural  Amplitude  Differences 

The  interaural  amplitude  difference  produced  by  the  head  shadow  varies  as 
a  function  of  the  frequency  of  the  sound  as  well  as  its  direction.  Figure  11 
•howa  the  1AD  measurements  for  pure  tones  on  five  human  subjects.  As 
expected,  the  IAD  is  near  zero  at  0°  azimuth,  and  reaches  a  maximum  between 
60 4  and  120*.  At  frequencies  less  than  1800  Hz,  the  IAD  is  relatively  small 

for  all  angles  of  incidence.  At  2500  Hz  the  IAD  reaches  a  maximum  of  about 

12  db  near  80*.  As  the  frequency  increases,  the  IAD  approaches  20  db  at  some 
angles.  These  data  are  in  agreement  with  Figure  10,  which  shows  that  the 
lAJnd  and  the  actual  IAD  produced  by  moving  a  source  near  the  median  plane 
follow  ons-another  closely  from  near  1500  Hz  to  about  6000  Hz, 

Another  important  feature  of  the  IAD  curves  in  Figure  11  is  that  the  rate 

of  change  of  IAD  with  azimuth  la  greatest  near  0°  and  generally  flattens  out 

as  the  source  moves  to  the  side  of  the  head.  Therefore,  any  rotation  of  the 
sound  source  or  the  head  in  the  horizontal  plane  should  produce  a  much  greater 
effect  near  04  than  at  larger  angles.  This  finding  agrees  well  with  Figure  4, 
which  shows  that  the  mlnimua  audible  angle  increases  rapidly  as  the  azimuth  of 
the  source  increases. 

As  noted  earlier  in  the  discussion  of  Figure  10,  at  frequencies  greater 
than  6000  Hz,  the  close  correspondence  between  IAjnd  and  the  physical  IAD 
ends.  Blauert  (1983)  found  that  IAD  changes  erratically  as  a  function  of 
frequency  above  6000  Hz.  Other  evidence  (see  section  2.2.3)  suggests  that 
spectral  changes  produced  by  the  pinnae  may  be  the  dominant  cues  at  the  higher 
frequencies. 

2.2.3  Monaural  Cues 

Interaural  time  and  intensity  differences  enable  the  listener  to 
discriminate  sounds  coming  from  the  right  vs.  left  side  of  the  head,  but  do 
not  differentiate  sounds  in  front  from  their  mirror-image  position  in  back  of 
the  head.  There  is  now  evidence  that  resonances,  reflection,  and  diffraction 
from  the  pinnae,  torso,  and  head  serve  as  monaural  cues  which  enable  the 
listener  to  make  front-back  discriminations.  These  anatomical  features 
produce  direction-dependent  changes  in  the  spectrum  of  the  sound  received  at 
the  eardrums. 


Figure  11.  Interaural  intensity  differences  as 
a  function  of  the  direction  of  the 
source  and  the  frequency  of  the  souno. 
(From  FedderBen  Sandel,  Teas,  and 
Jeffress,  1957.) 


In  order  for  moneural  cues  to  be  effective  aids  to  localization,  the 
listener  must  have  a  priori  information  about  the  spectrum  generated  by  the 
source*  Only  then  can  the  listener  judge  whetner  a  given  spectral  feature  is 
a  function  of  the  source  position,  or  a  feature  of  the  free-field  source 
spectrum*  Such  a  priori  information  can  be  gained  through  pre-experimental 
knowledge  or  repeated  presentations  of  the  source. 

The  monaural  cues  seem  to  consist  of  relative  amplification  of  one  region 
of  the  spectrum  relative  to  another  region*  If  the  source  spectrum  Is  limited 
to  a  very  narrow  bandwidth,  as  it  is  for  pure  tones,  then  little  or  no 
relative  amplification  is  possible.  If,  further,  the  amplitude  of  the  source 
is  varied  from  presentation  to  presentation,  any  noticeable  attenuation  or 
amplification  of  the  sound  from  one  exposure  to  the  next  Is  no  longer  a 
useable  cue  for  localization. 

Stevens  and  Newman  (1936)  found  that  tone  bursts  coming  from  in  front  of 
the  head  wars  often  perceived  as  coming  from  the  back.  When  the  Intensity  of 
the  sound  was  also  varied  randomly  from  one  presentation  to  the  next, 
listeners  performed  at  nearly  chance  level*  On  the  other  hand,  Musicant  & 
Butler  (1984a)  found  that  wide-bandwidth  noise  stimuli  were  localized  with 
very  few  front-back  reversal  errors  whan  thay  contained  significant  energy 
from  1  to  4  kHz  and  greater.  However,  when  the  listener's  pinnae  were 
occluded  in  the  same  study,  the  percentage  of  front-back  reverals  increased 
dramatically  for  both  wide-band  noise  and  for  4  kHz  high-pass  noise,  but  not 
for  4  kHz  low-pass  noise.  These  results  strongly  suggest  that  the  pinnae 
produce  effective  localization  cues  at  frequencies  of  4  kHz  and  greater. 

In  the  Musicant  and  Butler  study,  when  the  pinnae  were  not  occluded,  the 
4  kHz  low-pass  noise,  the  4  kHz  high-pass  noise  and  the  wide-band  noise  all 
produced  vary  few  front-back  reversals,  whereas  the  1  kHz  low-pass  noise 
produced  front-back  reversals  on  about  50  percent  of  the  trials.  This 
suggests  that  some  cues  also  operate  in  the  region  front  I  to  4  kHz  to  reduce 
front-back  reversals.  When  the  pinnae  were  occluded,  the  number  of  front-back 
reversals  increased  in  the  wide-band  and  4  kHz  high-pass  noise  conditions,  but 
not  for  the  4  kHz  low-pass  noise  condition.  This  result  suggests  that  the 
pinnae  produce  cues  only  above  4  kHz,  and  that  other  anatomically-related  cues 
(such  as  torso  reflection  and  monaural  head  shadow)  operate  in  the  l  to  4  kHz 
band  to  help  resolve  front-back  ambiguities,  Gardner  (1973)  reported  acoustic 
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data  (HRTF's)  for  manikins  with  and  without  a  torso.  The  torso  clearly 

produced  amplitude  variations  in  the  region  from  0.7  to  3.5  kHz. 

As  mentioned  earlier,  a  number  of  acoustic  investigations  have  been  done 
to  directly  measure ■  the  spectral  changes  produced  by  the  pinna,  head,  and 
torso  (Gardner,  19713;  Hebrark  6  Wright,  1974b;  Mehrgardt  &  Mellart,  1977; 
Rbdgers,  1981;  Shai^,  1974a;  1974b;  Shaw  &  Teraniahl,  1968;  Walnrelch,  1982). 

thd  resulting  HRTF's  show  the  ratio  of  sound  pressure  amplitude  (and/or 

difference  ip  phase)  at  the  ear  drum  or  ear  canal  entrance  versus  the 
amplitude  or  phasq  in  the  free  field  (when  the  listener  is  absent).  As  noted 
earlier,  there  are  numerous  large  differences  in  amplitude  response  at 
particular  frequencies  for  sound  sources  in  front  versus  the  mirror-image 
position  in  back  which  could  serve  as  localization  cues.  Shaw  (1974b) 
j)jresents  amplitude  HRTF's  normalized  to  0*  azimuth,  which  he  calls  "azimuthal 
dependence”  curves  (see  Figure  12).  Careful  inspection  of  these  curves 
^uggfcsts  possible  acoustic  features  which  could  play  a  role  in  front-back 
discrimination  and  monaural  localization  cues  generally.  He  notes  that  the 

overall  gain  (relative  to  0°)  Increases  smoothly  from  -45°  to  +45*  azimuth. 
From  60*  to  120*  the  gain  remains  about  constant,  except  in  the  2-6  kHz  region 
where  it  decreases  smoothly.  From  120*  to  165*  there  is  a  fairly  constant 
Increase  in  gain  from  2-5  kHz  accompanied  by  a  steady  decrease  at  other 
frequencies.  Shaw  concludes  that  these  regular  changes  in  amplitude  response 
are:  (1)  due  to  the  pinna,  and  (2)  provide  tha  physical  basis  for  (monaural) 
localization.  Further  psychoacoustic  research  is  needed  to  determine  the 
necessary  and  sufficient  acoustic  features  for  monaural  localization. 

Most  lnveotigators  have  reported  HRTF's  averaged  over  a  number  of 
listeners.  These  averaged  HRTF's  show  broad,  common  trends  in  amplitude 
response,  but  the  averaging  process  eliminates  much  of  the  "fine  structure”  of 
individual  HRTF's.  Mehrgardt  and  Mellert  (1977)  shifted  their  HRTF's  along 
the  frequency  dimension  before  averaging  in  order  to  preserve  major  features 
common  to  individual  HRTF's.  Rodgers  (1981)  has  suggested  that  ideosyncratic 
features  of  an  individual's  HRTF  may  be  important  for  localization 
performance. 

A  number  of  studies  show  that  listeners  quickly  adapt  to  a  change  in  the 
ideosyncratic  features  of  the  HRTF.  When  SAL  cues  derived  from  artifical 
pinnae,  or  models  of  someone  else's  pinnae,  are  presented  to  an  observer  via 
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earphones ,  localization  is  initially  less  accurate  than  with  one’s  own 

pinnae.  However,  adaptation  to  the  foreign  pinnae  occurs  rapidly.  After 
several  minutes  of  practice,  most  observers  are  able  to  localize  nearly  as 
accurately  as  with  their  own  pinnae  (cf.,  Searle,  1982;  Batteau,  1967). 

It  also  appears  that  the  anatomical  features  of  some  listeners  provide 
better  localization  cues  than  do  those  of  other  listeners.  Butler  and 
Belendluk  (1977)  reported  that  some  listeners  localize  more  accurately  when 
presented  with  cues  generated  from  another  person's  pinnae  than  with  their 
own. 

The  acoustical  studies  reviewed  above  relate  the  direction  of  a  aound 
source  to  the  proximal  acoustic  stimulus.  While  this  is  necessary  and  useful 
research,  it  is  not  a  sufficient  account  of  auditory  localization 

performance.  Ps yc ho ac oust leal  studies  are  needed  to  relate  the  proximal 
acoustic  stimulus  to  the  listener's  perception  of  the  source  location. 

One  of  the  problems  encountered  in  relating  the  proximal  stimulus  to  the 
percept  in  auditory  localization  has  bean  separating  the  effects  of  the 
various  cues,  such  as  ITD'a,  IAD's,  pinna  cues,  etc.  One  partially  successful 
approach  has  been  to  control  the  situation  under  study  by  using  pure  tones  as 
stimuli.  This  work,  reviewed  previously,  has  helped  to  identify  the  roles  of 
ITD's  and  IAD's.  However,  this  approach  has  not  been  successful  in  Isolating 
pinna,  torso,  and  monaural  head  cues  because  IAD's  still  play  a  role,  even  at 
high  frequencies. 

A  more  productive  approach  has  been  to  study  monaural  localization, 

thereby  eliminating  the  binaural  cues  such  as  ITD  and  IAD.  Butler  and  hia 

colleagues  have  pursued  this  approach  for  horizontal  as  well  as  median” plane 
localization.  Their  results  suggest  that  source  angular  position  is 
frequency-coded,  much  like  Bluuert's  (1969/70)  "critical  band"  hypothesis  for 
median  plane  localization. 

In  a  series  of  studies  (Butler  and  Flannery,  1980;  Flannery  and  Butler, 
1981;  Mus leant  and  Butler,  1984b),  Butler  and  his  co-workers  found  that  the 
perceived  direction  of  narrow-band,  high  frequency  noise  is  related  to  its 
center  frequency  (CF)  rather  than  its  actual  direction.  In  these  experiments, 
the  right  ear  was  acoustically  blocked  and  the  stimuli  were  presented  randomly 
from  various  azimuths  in  the  front-left  quadrant  or  whole  left  side.  A  1  kHz- 
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wide  band-pass  noise  centered  at  4  KHz  was  nearly  always  reported  as  coming 
from  directly  In  front  of  the  listener,  regardless  of  its  actual  position.  As 
the  CF  of  the  nolee  was  Increased,  the  perceived  location  tended  to  move  to 
the  side,  generally  reaching  90s  when  the  C7  reached  about  8  kHz.  As  the  CF 
Increased  beyond  8  kHz,  the  apparent  location  moved  back  directly  in  front  for 
most  listeners,  and  then  migrated  laterally  as  the  CF  waa  increased  further. 
When  the  CF  reached  12  to  13  kHz,  the  apparent  location  again  moved  back  in 
front  of  the  listener.  Not  all  listeners  followed  this  pattern,  and  there 
were  considerable  Individual  differences  among  those  who  did  follow  the 
pattern  as  to  what  CF's  were  associated  with  which  directions.  For  a  few 
listeners,  the  apparent  location  of  the  sound  moved  from  near  0°  azimuth 
through  the  rear  quadrant  to  nearly  180°  as  the  CF  increased  from  4  to  12  kHz. 

Flannery  and  Butler,  and  Musicant  and  Butler,  measured  the  relative  gain 
for  the  free-field  versus  the  entrance  to  subject's  ear  canals  for  1  kHz-wide 
noise  stimuli  with  various  CF's.  They  found  that  the  gain  for  any  given 
stimulus  tended  to  be  greatest  at  that  azimuth  that  listeners  most  often 
perceived  as  the  source  location.  However,  the  amplitude  differences  between 
locations  for  given  stimuli  were  frequently  as  small  as  1  db*  Similar  trends 
can  be  seen  in  the  HRTF's  reported  by  Shaw  (1974b)  and  Mehrgardt  and  Mailer t 
(1972).  In  Shaw's  Figures  6  and  7,  the  HRTF  for  a  4  kHz  tone  shows  a  broad 
peak  at  about  40*  azimuth.  As  the  frequency  increases  to  8  kHz,  the  peak 
migrates  to  90s  or  slightly  greater  azimuth.  A  similar  trend  can  be  seen  in 
Mehrgardt  and  Mellert's  Figure  18,  Thus  there  is  at  least  a  partial 
correspondence  between  the  features  of  averaged  HRTF's  and  the  perceived 
source  location  in  the  horizontal  plane. 

Butler  and  his  colleagues  call  each  frequency  region  which  moves  the 
sound  from  the  front  to  the  lateral  position  a  Spatial  Reference  Map  (SRM) • 
For  example,  for  most  listeners,  4-8  kHz  is  an  SKM,  as  is  8-12  kHz.  Butler 
and  Flannery  and  Flannery  and  Butler  compared  monaural  localization  accuracy 
for  4  kHz-wide  noise  with  various  CF's.  In  one  condition,  the  CF  was  such 
that  the  noise  bandwidth  spanned  two  SRMs,  in  another  condition  the  noise 
bandwidth  fell  entirely  within  an  SRM.  Localization  performance  was  better 
when  the  noise-bandwith  was  centered  over  an  SRM  boundary.  It  is  not  clear 
why  this  should  be  so,  and  the  authors  did  not  offer  a  satisfactory 
explanation. 


42 


Weinrich  (1982)  reported  strong  evidence  for  front-back  discrimination 
cues  around  1.2  kHz  and  from  about  3.5  to  6  kHz.  He  measured  the  HRTF  from 
the  free  field  to  near  the  ear  canal  entrance  for  azimuths  of  30°,  150s,  210°, 

and  330*  for  four  listeners.  He  then  subtracted  the  amplitude  response 

generated  by  a  source  in  back  from  that  generated  by  a  source  in  the 
corresponding  front  position  (e.g.  150*  response  minus  30*  response).  This 
was  done  for  both  the  ipsilateral  and  contralateral  ears.  In  the  ipsilateral 

ear,  he  found  a  broad,  marked  increase  in  SPL  from  about  3.5  to  6  kHz  and  a 

small  decreaso  around  1.2  kHz.  In  the  contralateral  ear,  he  found  a  marked 
decrease  around  1.2  kHz.  Ha  next  addressed  the  question  of  whether  these 
front-back  acoustic  differences  are  actually  used  as  cues  for  front- back 
discrimination.  Using  the  HRT7  features  noted  above,  he  produced  simulated 
front  and  back  cues.  The  cues  were  impressed  on  speech  and  white  noise 
stimuli  and  delivered  via  earphones.  Appropriate  ITD's  were  also  introduced 
to  simulate  front  and  back  positions  at  various  azimuths.  Subjects  were  able 
to  assign  these  synthesized  signals  to  an  array  of  12  directions  with 
relatively  few  front-back  reversals.  However,  the  spectral  features  Impressed 
on  the  signal  had  to  be  exaggerated  in  order  to  achieve  good  front-back 
discrimination  performance.  The  partial  success  achieved  by  Weinrich  suggests 
that  it  may  be  possible  to  produce  accurate  SAL  performance  based  on 
variations  in  the  spectrum  as  a  function  of  apparent  azimuth. 

2.3  Localization  in  the  Median  and  Vertical  Planes 

The  major  binaural  cues,  interaural  time  difference  (ITD)  and  lnteraural 
amplitude  difference  (IAD),  produce  not  only  front-back  but  also  above-below 
localization  ambiguities.  For  the  idealized  spherical  head,  the  ITD  and  IAD 
specify  the  location  of  a  sound  source  to  within  a  "cone  of  confusion,"  shown 
in  Figure  3.  Any  source  location  on  the  cone  produces  the  same  ITD  and  IAD. 
Of  course  real  heads  are  not  spherical,  but  this  does  not  eliminate  the 
ambiguities.  It  simply  makes  the  cone  an  irregular  rather  than  smooth 
surface. 

Localization  in  the  median  sagittal  plane  (MSP)  Is  of  Interest  because 
ITD's  and  IAD's  are  presumably  absent  there.  Many  investigators  have  assumed 
that  localization  In  the  MSP  is  based  solely  on  cues  produced  by  the  pinnae 
and  torso.  In  contrast,  for  judgements  of  source  elevation  In  other  vertical 
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planes,  ITD  and  IAD  cues  should  be  available,  as  well  as  pinna  and  torso 
cues*  One  would  therefore  expect  judgements  in  the  MSP  to  be  considerably 
less  accurate  than  comparable  judgements  in  the  other  vertical  planes. 
Gardner  and  Gardner  (1973)  measured  localization  accuracy  for  connected 
speech*  Loudspeakers  were  arranged  in  a  semicircular  arch  over  the  subject's 
head,  spanning  from  0°  elevation  in  front  of  the  subject  to  180°  in  back.  For 
different  test  conditions,  the  subject  was  rotated  +5",  15°,  45*,  or  90*  with 
respect  to  the  arch  in  order  to  test  localization  in  other  vertical  planes. 
The  results,  shown  in  Figure  13,  show  that  elevation  judgements  are  most 
accurate  for  the  transverse  plane  (90*  to  the  median  plane)  and  intermediate 
for  oblique  planes.  Thesa  result*  make  sense  in  terms  of  the  expected  change 
in  ITD  end  IAD  produced  by  e  givan  change  in  elevation,  Ac.  In  the  median 
plana,  As  produces  zero  change  ITD  and  IAD;  in  the  transverse  plane  the 
change  is  maximized;  in  oblique  planes  the  change  in  ITD /IAD  with  Ac  is 
lntarmsdiats* 

Localization  accuracy  also  varies  considerably  within  the  MSP. 
Wattschuraek  (1973)  measured  the  minimus  audible  angle  for  a  white  noise 
source  at  various  locations  in  the  MSP*  Ha  found  the  greatest  acuity,  about 
4*  in  tha  front  of  the  listener,  Intermediate  acuity  behind,  end  least  acuity 
(about  10*)  ovarhaad.  This  comparts  with  a  maximum  acuity  of  1*  in  the 
frontal  horlsontal  plane. 

2.3.1  Intereurel  Differences 

Studying  locelizetion  within  the  MSP  would  seem  to  be  a  convenient  way  of 
eliminating  ITD  and  IAD,  therefore  enabling  tha  investigator  to  study  torso 
and  pinna  cues  in  isolation.  However,  tha  assumption  that  interaural 
differences  play  no  role  in  MSP  localisation  has  been  questioned. 

As  shown  in  Table  1,  Soarle  et  el.  (1976)  propose  a  third  interaural  cue 
that  provides  information  about  the  elevation  as  well  as  the  azimuth  of  the 
sound  source.  This  third  cue  la  the  interaural  pinna  amplitude  difference. 
Asymmetries  in  the  left  end  right  pinnae  (and  head  asymmetries)  could  produce 
interaural  amplitude  differences  as  a  function  of  source  elevation  in  the  MSP. 

Searle,  Braids,  Cuddy  &  Davie  (1975)  measured  the  amplitude  response  of 
tue  head  and  pinnae  at  the  ear  canal  entrance  as  a  function  of  frequency.  The 
measurements  were  taken  on  three  human  subjects,  for  sources  at  various 
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Figure  13.  Localization  of  connected- 
speech  signals  as  function 
of  transverse  to  median 
plane  orientation.  (From 
Gardner  and  Gardner,  1973.) 
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elevations  in  the  MSP*  A a  expected,  there  was  a  common  change  In  amplitude 
response  over  frequency  for  both  ears.  These  are  the  monaural  pinna  cues*  In 
additlont  Searle  et«  al.  found  a  disparity  between  the  left  and  right  ear 
responses  which  changed  systematically  with  elevation  angle.  These 
differences  are  potential  lnteraural  cues  to  localization  in  the  MSP. 

Searle  et  al.  (1975)  argue  that  the  lnteraural  pinnae  differences  are 
used  as  cues  for  MSP  localization.  They  review  previous  research  which  shows 
that  monaural  localization  in  the  MSP  is  less  accurate  than  binaural 
localization  in  the  MSP  (cf.,  Butler,  1969a).  This  finding  could  be  due  to 
either  the  absence  of  binaural  disparity  cues,  or  to  the  loss  of  redundant 
information  from  one  ear  in  the  monaural  condition. 

As  a  test  of  these  two  alternate  explanations,  Searle  et  al.  (1975) 
presented  simulated  localization  cuss  for  sources  in  the  MSP  via  earphones  to 
four  subjects.  The  localization  cues  were  obtained  by  first  recording  from 
the  left  and  right  ear  canals  of  each  subject  for  sources  in  the  MSP.  The 
recordings  wore  then  played  back  either  dichotlcally  (left  channel  to  left  ear 
and  right  channel  to  right  ear)  or  diotically  (left  or  right  channel  to  both 
ears).  Searle  et  al.  also  varied  the  amount  of  a  priori  information  about  the 
signal  available  to  the  subjects  by  varying  the  spectrum  of  the  signal  from 
trial  to  trial  in  soma  test  conditions.  As  noted  in  Section  2.2.3,  the 
listener  must  be  familiar  with  the  source  spectrum  in  order  to  use  monaural 
cues  effectively.  They  reasoned  that  reducing  the  amount  of  £  priori 
information  available  should  affect  localisation  accuracy  less  in  the  dlchotlc 
than  in  the  dlotic  condition  if  binaural  disparities  were  used  as  cues  in  the 
former*  Their  results  show  better  localization  in  the  dlchotlc  conditions 
overall  and  greater  degradation  of  localization  accuracy  In  the  dlotic 
condition  when  the  source  spectrum  was  variad  from  trial  to  trial. 

The  Saarle  at  al.  (1975)  results  suggsst  that  binaural  disparity  cues  are 
used  in  MSP  localization,  but  they  are  not  conclusive.  In  the  dlotic 
presentation  condition,  one  of  the  ears  recsived  simulated  cues  generated  by 
recording  from  the  opposite  ear.  In  the  binaural  condition,  both  ears 
received  cues  generated  by  recording  from  themselves.  It  is  possible  that  the 
combination  of  unnatural  cues  in  one  ear  plus  the  experimental  manipulation  of 
trial-to-trial  spectral  variations  in  the  source  produce  an  interactive  effect 
on  »onaural  localization  accuracy.  For  example,  if  Saarle  et  al.  had  also 
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tested  their  subjects  monaurally,  one  might  find  that  varying  the  source 
spectrum  produces  greater  degradation  when  the  cues  were  recorded  from  another 
ear,  than  when  the  cues  were  recorded  from  the  ear  being  stimulated. 

On  the  other  hand,  Hebrank  and  Wright  (1974a)  argue  chat  binaural  cues  do 
not  play  a  role  in  MSP  localization.  In  a  first  experiment,  they  presented 
white  noise  and  "rippled-spectrum"  nolee  to  monaural  and  binaural  subjects, 
with  the  source  in  the  MSP.  The  spectrum  of  the  rippled  noise  contained  peaks 
and  notches  similar  to  those  Introduced  by  the  pinna.  The  location  of  these 
peaks  and  notches  was  varied  randomly  from  trial  to  trial.  The  rippled  noise 
presumably  denied  to  the  subjects  _a  priori  Information  about  the  signal 
spectrum,  and  should  therefore  degrade  localisation  based  on  monaural  cues. 
Habrank  and  Wright  found  that  the  rippled  noise  (versus  white  noise)  produced 
the  same  amount  of  degradation  in  localisation  performance  for  monaural  and 
binaural  subjects.  This  result  suggests  that  binaural  information,  even  if 
present  in  MSP  localization,  does  not  aid  localisation.  In  a  second 
experiment,  Hebrank  and  Wright  found  that,  if  given  localization  feedback, 
monaural  subjects  quickly  learned  to  localise  in  the  MSP  as  accurately  as  do 
binaural  subjects. 

Even  though  binaural  differences  may  play  no  direct  role  in  MSP 
localization,  they  are  responsible  for  centering  the  acoustic  image  in  the 
MSP.  As  Blauert  (1982)  points  out,  binaural  information  la  not  absent  in  MSP 
localization.  Rather,  binaural  differences  are  at  that  value  which  Indicates 
that  the  source  is  located  in  the  MSP.  Gardner  (1973)  also  points  out  that 
binaural  information  performs  a  "centralizing"  function  in  MSP  localization. 
When  one  ear  la  blocked,  the  acoustic  image  rotates  laterally  toward  the  open 
ear.  In  this  condition,  judging  the  position  of  the  source  in  the  MSP  becomes 
very  difficult. 

2.3.2  Monaural  Cues 

The  two  dominant  theories  of  monaural  localization  are  the  theory  of 
Timbre  Differences  (a  frequency  domain  Approach)  and  the  Echo-Delay  position 
(a  time  domain  approach).  The  latter  position  was  argued  by  Batteau  (1967)  to 
explain  how  the  pinnae  produce  direction-dependent  cues.  Specifically,  he 
proposed  that  reflections  from  the  various  folds  and  cavities  of  the  pinnae 
produce  reflected,  and  therefore  delayed,  signals.  The  relative  amount  of 
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each  delay  I9  postulated  to  depend  on  the  location  of  the  sound  source 
vertically  and  horizontally  relative  to  the  pinna.  Batteau  further  proposed 
that  tha  ear  and  brain  determine  the  sound  source  location  by  measuring  the 
amount  by  which  the  echoes  are  delayed  from  the  mein  signal. 

The  neurological  delay-processing  model  proposed  by  Batteau  has  been 
largely  discarded  due  to  claims  that  tha  temporal  resolution  of  the  auditory 
system  is  not  adequate.  The  minimum  audible  angle  in  the  median  plane  (where 
processing  is  based  only  on  monaural  cues)  is  about  4*  (Harris  &  Surgeant, 
1971;  Wattechureck,  1973).  In  ordar  to  discriminate  that  smell  a  change  in 
source  location,  the  temporal  acuity  of  the  ear  and  brain  would  have  to  be 
lees  then  5  Msec  (Batteau,  1967).  Green  (1971)  and  Zwlcker  (1973)  report  that 
tha  minimum  dlscrlmlnabla  monaural  time  difference  is  greeter  than  1  msec. 
However^  Hebrenk  end  Wright  (1974a)  found  that  tha  just-notlcaabla-difference 
(Jnd)  for  time  delays  was  in  the  region  from  3  to  7  Uaac.  They  used  time- 
delayed  white  noise  summed  with  itself  end  presented  monaurally.  Hebrenk  and 
Wright  conclude  that  the  human  auditory  system  has  the  required  temporal 
acuity,  but  reject  tha  neurological  delay-processing  model  on  other  grounds. 
They  bees  their  rejection  on  e  report  by  Thurlow  and  Rungs  (1967),  who  found 
that  clicks  were  localised  more  poorly  then  wee  white  noise.  If  the  auditory 
system  determined  location  by  Judging  delays,  stimuli  with  stsep  rise  times, 
such  as  clicks,  should  bs  localissd  mors  sccurstsly  than  the  relatively  slowly 
varying  envelope  of  e  noise  function* 

The  spectral  approach  to  monaural  localisation  holds  that  reflections, 
resonances,  end  diffraction  of  the  sound  sourcs  by  the  pinna  end  torso  produce 
direction-dependent  changes  In  the  spectrum  of  the  sound  reaching  the 
eardrum.  The  auditory  system  is  presumed  to  Judge  the  source  location 
according  to  tha  apactral  features  of  the-  sound  reaching  the  eardrum.  As  in 
the  case  of  monaural  localisation  in  ths  horizontal  plans,  jb  priori 
information  about  ths  sourcs  spectrum  is  required  in  order  to  judge  its 
location.  Otherwise,  the  auditory  system  has  no  way  of  discerning  whether  a 
given  spectral  feature  is  e  function  of  the  source  location  or  a  property  of 
the  original  source  spectrum.  The  spectral  theory  leads  naturally  to  the 
question  of  whet  spectral  features  are  produced  by  the  pinnae  and  torso  and 
used  by  the  brain* 
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A  number  of  Investigators  have  studied  the  frequency  range  over  which  the 
pinnae  and  torso  produce  useful  cues  for  MSP  localization.  Gardner  (1973) 
found  that  localisation  of  full-band  noise  in  the  anterior  portion  of  the  MSP 
was  degraded  when  the  pinnae  were  occluded*  He  also  coopered  localization 
accuracy  for  one-half  octave  bandwidth  noise  centered  at  2,  3,  4,  6,  8,  or  10 
kHz,  with  and  without  the  pinnae  occluded.  With  open  pinnae,  localization 
accuracy  increased  as  the  center  frequency  of  the  noise-hand  increased;  and 
was  best  for  full-band  noise.  When  both  pinnae  were  occluded,  localization 
accuracy  dropped  to  near  chance  for  the  4,  6,  8,  and  10  kHz  noiee  bands,  and 
to  just  above  chance  for  the  2  and  3  kHz  noise  bands  and  the  full-band 
noise.  These  results  suggest  that  the  pinnae  oroduce  useful  cues  from  3  or 
4  KHz  to  at  least  10  kHz,  end  that  soma  other  factor  (presumably  the  torso) 
produces  cues  in  the  2-3  kHz  region.  Gardner  and  Gardner  (1973)  report 
comparable  results  for  the  poaterlor  portion  of  the  MSP. 

Hebrank  and  Wright  (1974b)  measured  localization  accuracy  for  white  noise 
and  sharply  filtered  high-  and  low-pass  noises.  Source  position  was  varied 
among  nine  locations  from  -30  to  +210*  in  the  MSP.  Their  results,  presented 
In  Figure  14,  show  that  the  absence  of  spectral  energy  below  about  3.8  kHz  and 
above  16  kHz  does  not  affect  M8P  localization  performance.  These  results  are 
not  in  conflict  with  Gardner's  results  showing  limited  ability  to  localize 
sound  in  the  2-3  kHz  region.  Note  in  Figure  14  that  localization  accuracy  is 
still  slightly  above  chance  for  the  4  kHz  low-paaa  noise. 

The  most  compelling  evidence  for  the  spectral  theory  of  monaural 

localization  is  the  auditory  illusion  produced  by  narrow-band  signals  in  the 

MSP.  The  perceived  location  of  such  sources  is  related  to  their  center 
frequency  rather  than  their  actual  location.  Roffler  and  Butler  (1968) 
presented  tone  bursts  of  frequencies  from  250  Hz  to  7200  Hz  at  locations  from 
-13°  to  +20°  elevation  in  the  MSP.  They  found  that  reported  source  height  on 
a  54  inch  high  panel  in  front  of  the  listener  wee  monotonically  related  to 
stimulus  frequency.  They  also  report  several  experiments  which  show  that  this 
relationship  is  not  due  to  learned  associations,  e.g.,  the  convention  of 
labeling  tones  of  greater  frequency  "higher". 

Blauert  (1969/70)  presented  1 /3-octave  band-pass  noise  to  binaural 

listeners  from  loudspeakers  located  in  front,  above,  or  behind  the  subject. 

The  listener  reported  the  apparent  direction  of  the  sound  source  (in  front, 
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Figure  14.  Mean  percent  correct  reeponeee  for 
ten  eubjecte  localising  low-  and 
hlgh-paaa-flltered  white  nolae  with 
accuracies  of  +45*  (0)  and  +13*  (+) . 
Dotted  lines  show  mean  accuracies 
for  white  noise.  (From  Hebrank 
and  Wright,  1974b.) 
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behind,  or  above).  The  center  frequency k  duration  and  intensity  of  the 
stimuli  ware  varied  randomly  from  trial  to  trial.  Reported  source  direction 
was  related  to  signal  frequency,  and  not  to  signal  duration,  location,  or 
Intensity.  Specific  frequency  bands  were  associated  with  each  direction,  as 
shown  in  Figure  IS. 

Butler  and  Helwig  (1983)  report  results  strikingly  similar  to  those 
obtained  by  Blauert  at  frequencies  of  4  kH*  and  greater.  They  presented  1.0 
kHz-svide  band-pass  noise  bursts  at  center  frequencies  from  4  to  14  kHs.  The 
subject  was  seated  under  an  arched  array  of  loudspeakers  which  spanned  from 
-30°  to  +210°  elevation  in  tha  MSP.  Reported  source  location  was  related  to 
center  frequency  and  not  to  actual  source  location.  For  four  out  of  five 
listeners,  reported  source  location  increased  nearly  linearly  /torn  -15*  to 
about  4-180°  elevation  as  tha  center  frequency  increased  from  4  kHz  to  11  or  12 
kHz.  For  one  listener,  reported  source  elevation  increased  to  roughly 
overhead  (90*  to  105  )  at  8  kHz,  and  then  abruptly  ravartad  to  the  frontal 
position  at  9  kHz.  For  four  of  tha  five  listeners,  the  spatial  correlates  of 
frequencies  above  4  kHz  in  Butler  and  Halvig's  study  corresponded  vary  clocsly 
to  thoaa  in  Blauert*  s  study.  Hebrank  and  Wright  (1974b)  also  report  apparent 
source  positions  for  filtered  noise  which  ere  consistent  with  the  findings  of 
Butler  and  Helwig,  and  Blauert. 

Given  such  consistent  psychoacoustic  data,  one  would  expect  to  find 
reliable  corresponding  spectral  features.  He  ,,  ink  And  Wright  (1974b)  made 
models  of  three  subjects  ears  and  measured  their  amplitude  response  (HRTF) 
from  4  to  16  kHz  for  sources  at  various  elevations  in  the  MSP.  They  found 
three  features  of  the  HRTF's  that  were  common  among  the  three  ears,  and  which 
corresponded  to  the  perceived  locations  of  flltarad-nolse  stimuli:  (1)  a 

notch,  the  low-frequency  side  of  which  migrated  from  about  5  to  8  kHz  as  the 
source  elevation  Increased  from  -30*  to  +60°,  (2)  a  peak  between  7-9  kHz  for 
overhead  source  locations,  and  (3)  greater  energy  above  13  kHz  for  frontal 
source  locations  than  for  bahind  locations.  Butler  and  Balendiuk  (1977) 
measured  the  amplitude  response  from  4  to  9  kHz  for  8  subjects'  ears.  They 
reported  a  prominent  notch,  tha  center  of  which  moved  from  about  5.5  kHz  to 
about  7  kHz  as  the  source  elevation  changed  from  -30°  to  +30°.  The  same 
features  of  the  pinna  amplitude  response  have  also  been  reported  by  Shaw  and 
Teranlshi  (1968)  for  sources  at  elevations  of  -45*  to  +45°  in  the  transverse 
vertical  plane. 
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Figure  15.  Relative  frequencies  with  which  listeners  reported  one-third  octave 
-oise  of  various  center  frequencies  to  be  located  behind,  above,  and 
in  front.  (From  Blauert,  1969/70.) 
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A  number  of  Investigators  have  proposed  chat  elevation  In  the  MSP  is 
encoded  by  a  combination  of  spectral  features.  Wright,  Hebrank,  and  Wilson 
(1974)  point  out  that  the  spectral  features  used  by  the  auditory  system  In 
monaural  localisation  could  be  generated  by  pinna  reflections  added  to  the 
original  signal.  A  broad-band  signal  whan  summed  with  a  delay  of  itself  at  an 
appropriate  amplitude  ratio,  a,  produces  a  sound,  the  spectrin  of  which 
contains  multiple  peaks  and  notches  as  shown  in  Figure  16.  The  delays 
proposed  by  Batteau  (1967)  (10-80  Msec  for  asimuth  angles  and  100-300  Msec  for 
elevation  angles)  product  ripples  in  roughly  ths  appropriate  parts  of  the 
spectrum.  Watkins  (1978)  shows  that  the  spectral  cues  which  produce  the 
perception  of  particular  source  directions  can  be  deacrlbed  as  components  of  a 
"comb-filter"  pattern  which  can  be  produced  by  echo  delays  In  the  range  of 
values  suggested  by  Batteau.  Figure  17  shows  the  effect  of  adding  two  delayed 
components  of  a  white  noise  signal  to  itself.  Note  that  a  given  delay, 
produces  a  spectrum  with  multiple  peaks  and  notches.  The  frequencies  of  the 
peaks  and  notchss  migrate  as  the  amount  of  delay,  *v,  of  one  of  the  added 
signals  changes. 

Watkins  conducted  an  experiment  in  which  subjects  reported  the  last 

perceived  location  of  an  apparently  moving  source.  Apparent  movement  was 
created  by  varying  one  of  the  delays,  in  a  two-dslay-and-add  signal*  The 
"above"  and  "below"  labels  in  Figure  17  are  positioned  above  the  values 
which  were  perceived  as  above  (+40*)  sod  below  (-20*)  In  the  transverse 
vertical  plane. 

Note  that  the  spectral  features  shown  in  Figure  17  correspond  roughly  to 
those  reported  by  Hebrank  and  Wright  (1974b),  and  Butler  and  Belandluk  (1977), 
for  source  elevations  in  the  MS?.  Specifically:  (1)  there  is  a  notch,  the 
low-frequency  side  of  which  migrates  from  about  5  to  8  kHs  as  the  source 
elevation  increases,  and  (2)  there  is  a  peak  between  7-9  kHs  for  overhead 
(above)  locations.  Watkins  concludes  that:  (1)  the  same  monaural  mechanism 
contributes  to  vertical  localisation  in  both  the  transverse  and  median  planes, 

and  (2)  the  decoding  mechanism  for  vertical  location  is  based  on  the 

recognition  of  spectral  patterns  like  those  produced  by  a  multiple  delay-and- 

add  system. 


53 


mcauCNCv  cumd 


Figure  16.  Spectral  filtering  created  by  combining 
a  signal  with  a  delay  of  itself.  Delays 
less  than  30  psec  cause  low-pass  filtering 
and  delays  greater  than  30  /isec  cause 
multiple  spectral  notches.  (From  Wright, 
Hebrank,  and  Wilson,  1974.) 


2.4  Auditory  Pittance  Perception 

The  acoustical  characteristics  of  a  sound  field  Chat  vary  as  a  function 
of  the  distance  of  the  source  have  been  well  documented*  However,  the 

effectiveness  of  these  acoustical  parameters  as  psychophysical  cues  for  the 

perception  of  distance  has  not  been  thoroughly  studied.  A  distinction  must  be 
made  between  absolute  judgements  of  distance  and  relative  judgements  of 
distance!  absolute  Judgements  can  be  made  based  exclusively  on  the 

characteristics  of  the  sound  field,  whereas  relative  judgements  require  that 
the  listener  have,  or  aaaume,  a  priori  information  such  as  a  reference  sound 
or  repeated  exposure  to  the  sound  source*  Monaural  cues  (such  as  the 

Intensity  and  spectral  composition  of  the  sound  at  the  eardrum)  can  provide 
only  relative  Information,  in  principle,  binaural  cues  can  provide  absolute 
information  only  if  the  direction  of  the  source  is  known  or  assumed,  or  if  the 
sound  Is  repeatedly  sampled  as  the  head  is  rotated  and  the  source  Is  known  to 
be  stationary.  There  Is  considerable  evidence  that  monaural  cues  play  an 
important  role  In  relative  distance  judgements*  Thera  14  also  evidence  that 
binaural  information  plays  a  role  In  distance  judgement  when  the  head  is 
stationary.  However,  It  Is  not  clear  what  binaural  cues  ars  used* 

The  physical  characteristics  of  a  sound  field  vary  in  terms  of  both 
Intensity  and  frequency  composition  as  a  function  of  distance.  According  to 
the  1/E2  loss  law,  Intensity  decreases  as  distance  Increases*  Amplitude 
decreases  by  6  db  for  each  doubling  of  distance*  Coleman  (1963),  in  his 
review  of  auditory  depth  cues,  cites  various  psychophysical  data  which  support 
the  use  of  the  1/R2  loss  law  as  a  relative  cue  for  distance  judgements. 

The  usefulness  of  intensity  as  a  cue  to  distance  seems  to  depend  on  (l) 
the  distance  of  the  source  from  the  listener,  (2)  the  characteristics  of  the 
sound,  and  (3)  the  listener's  degree  of  familiarity  with  the  sound.  Bakes y 
(1949)  showed  that  the  perception  of  distance  tends  to  level  off  as  distance 
increases  past  a  critical  point.  This  outer  limit  on  distance  perception  of 
auditory  events  has  been  termed  the  auditory  horlson.  Gardner  (1969b)  showed 
that  the  characteristics  of  the  input  (t.e.,  whispered  vs.  shouted  speech) 
affect  the  distance  estimate,  even  though  amplitude  loss  with  distance  is  the 
seme  for  both  inputs*  The  results  indicated  that  the  estimated  dlstence  of 
shouted  speech  tends  to  be  greater  than  that  for  whispered  speech.  Coleman 
/ l 962)  found  that  distance  judgements  were  unrelated  to  actual  distance  on  the 
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first  trial  of  a  repsatedrmeasures  experiment.  However ,  with  repeated 
exposure,  accuracy  Increased.  This  result  Indicates  the  Importance  of 
experience  In  judging  distance. 

The  frequency  spectra  of  a  complex  sound  field  has  also  been  Implicated 
In  providing  usable  cues  to  distance.  Changes  In  the  frequency  spectra  occur 
as  a  function  of  the  distance  of  the  source  from  the  observer.  At  short 
distances  «  4  feet)  the  sound  field  Is  relatively  spherical  In  shape.  Bakeay 
(1960)  reported  that  a  sound  source  appears  to  approach  the  observer  if  the 
low  frequency  components  of  the  source  are  increased,  relative  to  the  high 
frequency  Information. 

At  farther  dlatances  the  wave  front  becomes  planar  In  shape  (l.e»,  far- 
field  conditions  obtain).  In  the  far-field,  high  frequencies  aro  attenuated 
more  rapidly  than  are  lower  frequencies  as  source  distance  Increases.  The 
difference  in  the  rate  of  attenuation  of  high  varus  low  frequencies  depends  on 
humidity,  temperature,  terrain  features,  and  inhomogenelties  in  the 
atmosphere.  These  atmospheric  effects  are  in  addition  to  the  1/R2  loss. 

The  psychophysical  data  suggest,  at  least  in  monaural  situations,  that 
frequency  composition  of  a  sound  serves  as  a  relative  cue  for  judgements  of 
distance.  Coleman  (1968)  showed  that  observers  were  accurate  at  localising 
the  distance  of  a  source,  located  8  to  24  feet  away,  based  upon  spectral 
cues.  Loundsbury  and  Butler's  (1979)  investigation  of  spectral  cues  Indicated 
that  some  observers  were  quite  accurate  at  locating  distances.  However,  19% 
of  their  observers  consistently  inverted  their  estimates  of  distance. 

Binaural  cues  to  distance  have  also  been  investigated.  The  potential 

cues  Include  lnteraural  time  differences  (ITD's),  lnteraural  amplitude 
differences  (lAD's),  and  lnteraural  spectral  differences  (ISO's).  Holt  and 
Thurlow  (1969)  demonstrated  that  when  lnteraural  cues  are  available,  observers 
can  accurately  rank-order  the  distances  of  the  sources.  When  the  source 
direction  waa  to  the  side  of  the  head,  observers  could  accurately  judge 

distance;  but  when  the  source  was  in  front  of  the  head,  they  could  not. 

Levy  and  Butler  (1978)  directly  manipulated  ITD,  IAD  and  ISD  cues.  They 

recorded  broad-band,  low-paas,  and  high-pass  noise  bursts  in  stereo  from  a 
live  model's  head  with  the  source  at  a  distance  of  5  feet.  Three  separate 
recordings  were  made  which  contained  (1)  ITD's,  IAD's  and  ISO's,  (2)  only 
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ITD's  and  ISO's,  or  (3)  only  ITD's.  The  processed  stimuli  were  then  presented 
dichotlcally  to  a  listener  who  judged  their  apparent  distance.  Judged 
distance  was  related  to  the  frequency  composition  of  the  stimuli.  Elimination 
of  lower  frequencies  (and  thus  the  fine-structure  ITD  cue)  resulted  in  serious 
Underestimation  of  the  source  distance.  Distance  estimates  did  not  change 
significantly  when  I AD' 9  and  ISO's  were  eliminated.  The  results  suggest  that 
ITD's  are  the  major  binaural  cue  for ,  distance  perception.  In  a  second 
experiment ,  they  separated  ongoing  (fine  structure)  ITD’s  from  onset 
(envelope)  ITD's.  Elimination  of  onset  ITD's  resulted  in  significantly 
greater  underestimation  of  the  source  distance.  Apparently  both  fine 
structure  end  envelope  lTD'i  play  a  role  in  distance  perception. 

The  effects  of  head  movements!  to  create  changes  in  IAD's,  has  been 
suggested  as  a  binaural  cue  for  distance.  The  approach  employed  has  been  to 
mathematically  model  the  Information  required  to  derive  distance.  Hirsh 
(1968)  attempted  to  model  auditory  depth  perception  based  solely  on  lnteraural 
time  and  intensity  differences*  Unfortunately,  without  the  repeated  sampling 
available  with  head  movements,  Hirsh's  model  resulted  in  correlated  equations, 
whereby  IAD  and  ITD  could  not  be  independently  estimated  (Mollno,  1973). 
Lambert  (1974)  has  shown  that  listeners  could  Judge  distance  based  on  the  rate 
of  change  of  the  IAD  as  the  head  Is  rotated  In  the  horlsontal  plane.  However, 
no  psychophysical  evidence  Is  yet  available  to  support  the  usefulness  of  this 
cue  or  other  head-movement  related  cues  in  judging  distance. 

2.3  Effects  of  Head  Movement  and  Vision  on  Auditory  Localisation 

Four  major  explanations  have  been  advanced  to  Account  for  the  finding 
that  head  movements  and  the  availability  of  visual  inputs  make  auditory 
localisation  Judgments  more  accurate*  The  four  explanations  should  be 
regarded  as  different  aspects  of  the  same  general  problem,  rather  than  as 
mutually  exdualve  hypothaeea.  Thay  include!  (1)  the  cue-modulation  position 
(Wallach,  1940),  (2)  tha  central  auditory  acuity  hypothesis  (Pollack  and  Rose, 
1967),  (3)  tha  visual  frame-of-raferanca  theory  (Warren,  1970;  Platt  and 
Warren,  1972),  and  (4)  tha  motor  feadback/spattal  memory  position  (Jones, 
1975;  Jones  and  Kabanoff,  1975)* 

Tha  first  explanation,  advanced  by  Wallach  (1940),  is  that  moving  tho 
'lead,  and  hanca  tha  ears,  modulates  tha  binaural  acoustic  cues,  i.e. , 
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interaural  time  and  amplitude  difference*  (ITD  and  IAD).  Wallach  noted  that 
the  change  in  these  binaural  cues  with  head  movement  depends  on  the  initial 
direction  of  the  sound  source  relative  to  the  heed.  As  the  head  is  rotated 
counter-clockwise  in  the  horisontal  plane,  the  ITD  and  IAD  increase  for 
sources  on  the  right-front  and  left-rear  sides  of  the  head,  but  decrease  for 
sources  in  the  other  two  quadrants*  Pivoting  the  head  from  side  to  side 
produces  corresponding  effects  in  the  vertical  plane.  Wallach  suggested  that, 
such  movements  are  especially  helpful  in  resolving  sound  sources  in  front  from 
those  in  back  and  those  above  from  those  below. 

1  •  ■  i 

Lambert  (1974)  has  developed  a  mathematical  model  which  describes  how  the 
ITD  cue  changes  as  a  function  of  head  rotation  for  an  idealised  observer*  The 
head  is  modeled  as  a  sphere,  with  two  holes  for  ears  located  at  +,  90* 
azimuth.  Assuming  that  the  sound  is  of  low  enough  frequency  so  that  phase 
ambiguities  do  not  occur,  Lambert's  model  shows  that  the  observer  can  uniquely 
determine  the  source  asimuth  based  solely  on  the  modulation  of  the  ITD  with 
head  rotation. 

Other  acoustic  cues  also  change  as  the  head  is  rotated  or  pivoted*  The 
change  in  IAD  as  a  function  of  the  asimuth  of  the  source  relative  to  the  head 
was  described  in  Section  2*2.2*  The  role  of  IAD  in  vertical  localisation  is 
covered  In  Section  2.3  A a  noted  in  those  sections,  the  IAD  is  sizeable  only 
at  medium  and  high  frequencies*  Studies  of  auditory  latarallzation  using 
dichotic  presentation  have  amply  demonstrated  that  variation  of  the  IaD  causes 
the  percstvsd  location  of  the  sound  to  shift  laterally  (cf.,  Mills,  1972; 
Durlach  and  Colburn,  1978), 

Head  rotation  and  pivoting  should  also  modulate  monaural  (chit fly  pinna- 
produced)  cuts.  Under  normal  listening  conditions.  It  Is  natural  to  reorient 
one's  head  toward  a  newly  heard  sound  source.  Prsadman  and  Fisher  (1968)  have 
suggested  that  monaural  cues  help  direct  the  initial  orienting  response  (heed 
movement)  to  a  sound,  but  do  not  sarve  as  important  cuss  during  hand 
movement.  They  measured  localisation  accuracy  in  the  horizontal  plana  with 
listeners  using  their  own  pinnae,  no  (t.e.,  occluded)  pinnae,  or  artiflcai 
pinnaa,  with  or  without  head  movement.  With  no  head  movement,  their  own  or 
artiflcai  pinnaa  Increased  localisation  accuracy  considerably  relative  to  the 
no-pinnae  condition.  However,  whan  head  movement  was  allowed,  there  wee  no 
significant  difference  between  the  pinnae  and  no-pinnae  conditions.  This 
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result  suggests  that  the  modulation  of  binaural  cues  during  head  movement  is 
so  powerful  that  It  swamps  out  any  effect  of  monaural  cue  modulation. 
Unfortunately,  Freedman  and  Fisher  did  not  measure  the  time  required  to 
localize.  If  their  hypothesis  that  pinnae  cues  help  direct  the  initial 
orienting  response  is  correct,  then  localization  time  should  be  shorter  when 
pinnae  cues  are  available. 

A  second  hypothesis  has  been  advanced  by  Pollack  and  Rose  (1967).  They 
found  that  head  movement  facilitated  auditory  localization  only  when  (1)  the 
sound  source  is  initially  located  toward  the  side  of  the  head,  and  (2)  the 
duration  of  the  stimulus  exceeds  the  time  required  to  reorient  the  head  toward 
th.2  source.  They  draw  an  analogy  between  foveal  visual  acuity  and  auditory 
acuity  in  the  region  tround  0°  azimuth,  and  suggest  that  head  movement  allows 
the  listener  to  take  advantage  of  the  higher-acuity  region. 

A  third  explanation  has  been  called  the  visual  frame-of-reference 
hypothesis  (Warren,  1970;  Platt  &  Warren,  1972;  Shelton  &  Searle,  1980).  In 
the  original  formulation  of  this  position,  Warren  (1970)  suggested  that,  for 
adults,  vision  is  the  primary  means  of  organizing  sensory  space,  and  that  the 
necessary  condition  for  facilitation  of  auditory  localization  is  structured 
visual  input.  In  a  later  article,  Platt  and  Warren  (1972)  modified  the 
original  position  to  stress  the  Interaction  of  eye  movements  and  visual  (i.e., 
retinal)  information.  They  reported  that  eye  movements  in  a  lighted,  textured 
environment  facilitate  localization  relative  to  a  condition  in  which  the  eyes 
are  fixated  in  a  lighted  environment.  However,  eye  movements  in  a  dark 
environment  did  not  produce  a  facilitation  relative  to  the  same  control 
condition.  Similar  results  have  been  reported  by  Mastroianni  (1983b). 

The  facilitative  effect  of  vision  in  the  Platt  and  Warren  (1972)  study 
should  be  distinguished  from  the  intersensory  bias  or  "visual-capture" 
phenomenon.  The  latter  phenomenon  is  the  tendency  for  a  sighted  listener  to 
perceive  the  origin  of  a  sound  to  be  a  plausible  visual  object.  In  the  Platt 
and  Warren  (1972)  study,  the  visual  background  was  a  burlap  curtain,  i.e., 
there  were  no  visual  objects  which  could  have  been  interpreted  as  the  sound 
source.  Pick,  Warren,  and  Hay  (1969)  reported  an  experiment  in  which  subjects 
pointed  at  (1)  the  heard  position  of  a  sound  with  the  loudspeaker  viewed 
through  a  prism  which  displaced  the  visual  image,  or  (2)  the  seen  position  of 
the  loudspeaker  emitting  sounds.  The  heard  position  of  the  sound  was  strongly 
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biased  in  the  direction  of  the  displaced  visual  image,  but  the  seen  position 
was  not  biased  by  the  sound. 

A  fourth  explanation  for  the  facilitative  effect  of  vision  on  auditory 
localization  has  been  advanced  by  Jones  (1975)  and  Jones  and  Kabanoff 
(1975).  They  contend  that  motor  activity  alone  provides  the  spatial 
framework,  and  that  all  sensory  stimulation  maps  onto  this  spatial 
framework.  These  investigators  suggest  that  eye  movements  in  the  direction  of 
the  auditory  stimulus  facilitate  localization  by  "stabilizing"  or  updating 
spatial  memory.  Jones  and  Kabanoff  explain  Warren’s  (1970)  and  Platt  and 
Warren's  (1972)  findings  that  eye  movements  facilitate  auditory  localization 
only  in  a  lighted,  textured  environment  by  noting  that  eye  movements  are  more 
accurate  in  the  light.  They  propose  that  the  role  of  retinal  information  is 
to  reduce  eye  drift  rather  than  to  provide  the  primary  spatial  point  of 
reference.  Jones  and  Kabanoff  (1975)  report  an  experiment  in  -which  eye 
movements  in  the  direction  of  an  auditory  stimulus  facilitated  localization, 
whereas  eye  movements  in  the  opposite  direction  produced  less  accurate 
localization  than  a  control  condition  in  which  the  eyes  were  fixated.  In  that 
experiment,  the  measure  of  localization  accuracy  was  the  observer's  percentage 
of  correct  judgements  as  to  whether  the  auditory  stimulus  was  3°  to  the  right 
or  3°  to  the  left  of  the  median  plane  of  his  or  her  body.  The  speakers  were 
hidden  from  view  in  order  to  rule  out  visual  capture  effects.  The  finding 
that  eye  movements  in  the  direction  opposite  the  sound  degraded  localization 
is  contrary  to  the  visual-frame-of-reference  position.  However,  this  finding 
does  not  rule  out  the  possibility  that  visual  (retinal)  information  may  play 
some  role  in  conjunction  with  eye  movements  to  facilitate  auditory 
localization. 

Shelton,  Rodgers,  and  Searle  (1982)  note  that  the  visual-frame-of- 
reference  and  the  spatial  memory  positions  are  not  mutually  exclusive.  Even 
though  the  eyes  are  constantly  moving,  one  perceives  stationary  objects  in  the 
environment  as  remaining  stationary.  The  fact  that  the  image  of  an  object 
moves  across  the  retina  does  not  necessarily  lead  to  the  perception  that  the 
object  is  moving.  The  visual  system  must  somehow  integrate  retinal 
information  and  extra-retinal  eye-position  information  (EEPI)  in  order  to 
determine  whether  the  object  or  the  eye  is  moving  (Matin,  Stevens  and  Picoult, 
1983).  Given  that  retinal  information  and  EEPI  are  used  in  visual 
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localisation,  it  seems  plausible  that  they  both  also  play  a  role  In  auditory 
localization. 

It  is  also  clear  that  the  modulation  of  acoustic  cues  with  head  motion 
relays  a  role  in  auditory  localisation  independently  of  vision.  Thurlow  and 
Xunge  (196?)  found  that  head  movements  facilitated  auditory  localisation  even 
when  subjects  whet*  blindfolded.  Wallach  (1940)  reports  three  conditions  in 
which  a  blindfolded  listener  was  passively  rotated  in  a  chair  and  a  sound 
source  was  also  rotated  about  the  same  axis,  starting  from  a  point  directly  in 
front  of  the  listener.  Depending  on  the  rate  of  rotation  of  the  sound 
relative  to  that  of  the  listener,  several  different  auditory  Illusions  were 
crested*  When  the  sound  rotated  at  the  sane  rate  as  the  listener,  it  was 
perceived  as  emanating  from  above  the  listener's  haad*  When  the  sound  was 
rotated  at  twice  the  rate  of  the  listener,  It  was  perceived  as  coming  from 
behind,  and  when  it  was  rotated  at  l.S  times  the  listener's  rate,  it  appeared 
to  be  at  an  elevation  of  60*  and  behind* 

Logically,  it  seems  necessary  that  the  brain  must  integrate a the  acoustic 
cuee  produced  by  head  motion  with  proprioceptive,  vestibular,  and/or  visual 
cues.  A  sound  may  move  relative  to  the  ears  either  because  the  haad  is 
rotating  and  the  sound  source  is  stationary,  or  because  the  sound  is  moving 
relative  to  the  body  and  the  head  is  stationary  relative  to  the  body.  In 
order  for  a  listener  to  distinguish  between  these  two  cases,  the  changing 
acoustic  cues  must  be  Integrated  with  proprioceptive  feedback  from  the  neck 
muscles,  and/or  vestibular  cues  and/or  visual  feedback. 

Plgure  18  shows  a  model  of  auditory  or  auditory  and  visual  localization 
which  provides  sn  lntsgratlvs  framswork  for  ths  svldance  offerad  in  support  of 
ail  four  of  ths  positions  dlicusssd  sbova.  The  modal  shows  how  visual, 
vestibular,  proprioceptive,  and  auditory  information  might  interact  in  an 
auditory  localisation  task.  Tha  model  is  mads  up  of  three  feedback  control 
loops,  each  of  which  consists  of  an  error  detection  process  and  a  process 
being  controlled.  The  error  detector  combines  feedback  from  ths  controlled 
process  with  other  rafsrence  information  to  generate  sn  error  signal.  The 
error  algnal  guides  the  controlled  process.  Control  loop  C  represents  the 
process  of  visually  fixating  or  tracking  an  object  in  space.  The  error  signal 
from  this  control  loop  serves  as  one  of  several  kinds  of  reference  inputs  to 
control  loop  B.  Loop  B  represents  the  process  of  orienting  the  head  toward 
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Figure  18.  Feedback  control  model  of  auditory  localization. 


(or  Cracking)  a  visual  or  acousCic  event  in  space.  Depending  on  the 
situation,  auditory,  vestibular,  visual,  and  proprioceptive  cues  may  all  play 
a  role  in  determining  tha  error  signal  in  this  control  loop.  Contextual  cues 
end  prior  knowledge  may  also  play  a  role.  For  example,  Welsh  and  Warren 
(1980)  avgwe  that  lntcreensory  bias  occurs  only  when  the  listener  assumes  that 
the  suditory  and  visual  stimuli  are  representative  of  the  seme  object. 

Control  loop  Ain  Figure  18  represente  tha  process  of  pointing  the  arm  to 
localise  a  sound  source.  The  reference  input  for  this  loop  is  the  error 
signal  from  loop  B,  i.e.,  tha  sound  source  direction  relstivs  to  the  body. 

Control  loop  B  requires  at  least  on*  reference  input  snd  proprioceptive 
feedback  from  the  neck  muscles.  The  sols  rsfarsncs  input  must,  of  course,  be 
appropriate  Co  the  modality  of  the  stimulus  being  localised.  Thus  a 
blindfolded,  static  listener  can  localise  e  sound  even  though  vestibular  and 
visual  cues  are  not  available. 

When  more  than  one  rsfarsncs  input  is  available,  it  seems  reasonable  to 
expect  that  the  accurecy  of  the  error  signal  generated  by  the  error  detector 
in  loop  B  may  be  enhanced.  If  e  secondary  reference  input  is  mors  praclsa 
than  the  original,  aa  vision  is  relstivs  to  audition,  then  auditory 
localisation  accuracy  should  ba  snhsncsd,  as  tha  studlss  cltsd  prsvlously  have 
demonstrated. 

The  Jones  and  Kabanoff  (1979)  finding  that  eye  movements  in  the  direction 
opposite  an  auditory  stimulus  degrade  auditory  localisation  may  he  explained 
ae  follows.  Bye  movements  In  the  opposite  direction  should  generate  reference 
Inputs  to  control  loop  B  which  are  irrelevant  to  the  task  at  hand.  If  they 
cannot  be  Ignored,  then  they  might  well  decrease  the  accuracy  of  the  error 
•lgnal  generated  In  that  loop.  In  the  Jones  and  Kabanoff  atudy,  aye  movements 
vers  cued  away  from,  versus  toward,  the  sound  sourca  randomly  from  trial  to 
trial.  Thsrsfors,  the  subject  had  no  way  of  knowing  a  priori  whether  the  eye 
movements  were  consistent  or  inconsistent  with  the  location  of  the  auditory 
lmaga.  Thus,  it  seems  that  Irrelevant  visual  information  can  disrupt  the 
process  of  judging  the  sound  dlraction  ralative  to  the  body,  which  ia 
rapraaantad  by  control  loop  B* 

Tha  feed back-control- loop  model  suggests  that  the  listener  should  be  able 
to  Ignore  irrelevant  (or  even  inconsistent)  visual  information  during  auditory 


localization  if  he  cr  she  knows  that  it  is  irrelevant.  Easton  (1983)  reports 
a  study  in  which  inconsistent  visual  information  was  successfully  Ignored  in 
auditory  localisation.  Subjects  located  a  small  loudspeaker  using  either  the 
visual,  auditory,  or  both  modalities.  On  dual-modality  trials,  subjects 
viewed  the  speaker  through  a  prism  which  displaced  the  visual  image.  On  one- 
half  of  the  trials  the  subject  was  to  point  to  the  seen  position  of  the 
speaker;  on  the  other  trials  he/she  was  to  point  to  the  heard  position.  In 
different  conditions,  the  subject's  head  was  immobilized  or  free  to  move,  and 
the  subject  was  either  unaware  of  the  visual  distortion  or  was  informed.  When 
the  head  was  immobile,  informing  subjects  of  the  visual  distortion  greatly 
reduced  visual  bias  of  the  auditory  localization  response.  When  subjects  wero 
allowed  free  head  movement;  informing  them  of  the  visual  distortion  completely 
eliminated  the  visual  bias  in  localizing  the  heard  position  of  the  speaker. 

The  feedback-control-loop  model  of  auditory  localization  was  suggested  in 
part  by  the  work  of  Matin  and  his  associates  ( cf . ,  Matin,  1982a,  1982b;  Matin, 
Plcoult,  Stevens,  Edwards,  Young,  and  MacArthur,  1982;  Matin,  Stevens,  and 
Plcoult,  1983).  Their  work  shows  that  the  feedback  from  eye  movements  (i.e., 
EEPI)  necessarily  plays  a  role  In  visual  localization,  and  ran  also  play  a 
strong  part  In  auditory  localization.  In  one  experiment,  Matin  et  al.  (1982, 
1983)  partially  paralyzed  observers  by  administering  curare.  The  cursre  had 
the  effect  of  subjectively  requiring  increased  effort  to  move  the  a  but 
did  not  affect  auditory  localization  accuracy.  With  the  head  immobilized,  an 
observer  was  Cold  to  fixate  a  visual  target  at  various  azimuths  (£  10°,  +_  15°) 
in  the  horizontal  plane,  At  eye  level.  The  observer  then  judged  when  a  sound, 
which  could  be  switched  among  any  one  of  25  Loudspeakers  spanning  +_  30° 
azimuth,  was  co-located  with  the  fixated  light.  Paralyzed  observers 
consistently  placed  the  sound  at  a  greater  absolute  azimuth  from  the  median 
plane  than  the  fixated  Light.  Unpaialyzed  observers  were  able  to  match  the 
light  and  sound  positions  accurately.  This  result  suggests  that  EEPI  is 
distorted  in  paralyzed  observers;  specifically,  that  greater  effort  is 
required  to  move  and  hold  the  eyes  at  a  given  deviation  from  the  median  plana 
of  the  haari.  It  .ippears  that  the  observer  interprets  toe  greater  effort  to 
mean  that  his  eyes  are  at  a  greater  angle  from  the  median  plane  than  they 
actually  are.  This  causes  the  observer  to  piace  the  sound  at  that  perceived 
greater  angle,  which  exceeds  the  actual  ang Le  of  the  light.  Matin  et  al. 
conclude  that  EF.Pl  is  used  to  judge  the  position  of  the  eyec  relative  to  the 
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head,  and  also  serves  as  a  spatial  frame  of  reference  for  auditory 
localisation. 

The  feedback-control- loop  model  la  a  uaeful  device  for  Integrating  the 
many  findings  concerning  head  and  eye  movement  effects  on  auditory 
localisation.  It  ahould  be  noted  chat  the  model  la  lntandad  to  account  only 
for  the  affects  of  othar  sanaory  inputs  on  auditory  localisation.  It  may  not 
apply  to  pursly  visual  localisation.  Tha  nodal  may  provide  a  useful  framework 
for  further  research.  Some  lmpllcetlone  of  the  model  for  further  research  are 
discussed  in  Section  2.9. 

2.6  Perception  of  Auditory  Motion  and  Volaae 

The  preceding  section  on  head  and  aya  movements  makes  obvious  tha 
importance  of  evaluating  auditory  localisation  in  dynamic  situation*.  Another 
facet  of  localisation  in  dynamic  situations  involves  locstlng  sources  which 
srs  thsmselves  in  motion.  Auditory  motion  may  bs  defined  relative  to  both 
real  or  illusory  motion,  and  to  the  general  location  of  the  stimulus  (e.g.  , 
intracranial  movamant  or  axtarnal  source  movement). 

The  investigation  of  ixlusory  auditory  motion  has  paralleled  the  study  of 
apparent  movement  in  the  visual  system.  Perrott  (1982)  revlavmd  three  types 
of  illusory  motion:  the  auditory  autoklnetic  effect,  motion  after-effects, 
and  induced  motion.  The  report  of  auditory  movamant,  or  jitter,  of  a 
stationary  source  has  baan  termed  auditory  autokinesia.  Parrott,  Mason  and 
Porbas  (1973),  in  their  review  of  the  auditory  autokinesia,  identified  the 
effects  of  signal  bandwidth  on  the  illusion.  Increases  in  bandwidth  result  in 
decreased  incidence  of  perceived  motion  and  decreased  duration  of  motion  If  it 
does  occur.  Greater  bandwidth  also  increases  the  latency  to  the  first 
perception  of  movement.  Tha  auditory  motion  aftereffect  and  its  visual 
counterpart,  the  waterfall  illusion,  create  the  perception  of  movement  through 
the  termination  of  a  moving  auditory  background.  The  perception  of  movement 
occurs  in  the  opposite  direction  of  tha  background.  Perrott  (1982)  reported 
that  only  minimal  target  displacement  (a  few  degrees)  has  been  observed. 
Induced  motion  creates  the  perception  of  movement  through  the  movement  of 
nearby  sources.  The  effects  of  Induced  motion  effects  are  reportedly  small 
(Perrott,  1982). 
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Perrott  and  Muslcant  (1977,  1981)  studied  listener's  sensitivity  to  real 
motion  of  a  sound  source  In  the  horizontal  plane.  Perrot  and  Muaicant  (1977) 

measured  the  minimum  arc  through  which  a  sound  must  travel  in  order  to  be 

ciscriminable  from  a  stationary  source  at  roughly  the  same  azimuth.  The 

duration  of  a  500  Hz  tone  emitted  from  a  loudspeaker  rotating  about  the 
listener  was  adjusted  until  the  listener  correctly  discriminated  the  moving 
source  from  a  stationary  source  on  751  of  the  trials.  The  minimum  arc 
required  for  discrimination  was  called  the  minimus  audible  movement  angle 

(MAMA).  The  MAMA  is,  of  course,  confounded  with  signal  duration.  The  MAMA 
increased  linearly  with  angular  velocity;  from  8.3"  at  90*/eec  to  21.2"  at 
360 "/sec. 

In  a  second  study,  Perrott  and  Muaicant  measured  a  dynamic  counterpart  of 
the  minimus  audible  angle  (MAA)  developed  by  Mills  (1958).  Observers  adjusted 
the  onset  of  a  500  Hz  tone  emitted  by  a  loudspeaker  rotating  about  them  in  the 
horizontal  plane  until  it  corresponded  with  the  position  of  a  light  (at  0° 
azimuth).  The  standard  deviation  of  the  position  at  which  the  sound  was 
turned  on  (the  dynamic  MAA)  ties  about  1*  for  sound  velocities  of  45,  60,  and 
1 20 "/sec.  The  dynamic  MAA  was  about  2.5°  at  a  source  velocity  of  240*/sec. 
However,  the  mean  position  at  which  the  sound  was  turned  on  was  biased  in  the 
direction  of  rotation  of  the  sound  (about  5*  at  45"/s«c  to  12"  at  240"/sec). 
Perrott  and  Muaicant  note  that  the  dynamic  MAA  corresponds  in  magnitude  to  the 
static  MAA  obtained  by  Mills  and  that  the  MAMA  (Perrott  and  Muaicant,  1977)  is 
much  larger.  The  difference,  they  suggest,  indicates  that  the  MAMA  measures  a 
fundamentally  different  capability  of  the  auditory  system  than  does  the  MAA. 

Perrott  (1982)  examined  observer's  ability  to  judge  the  velocity  of  a 
sound  source  in  a  darkened  chamber.  For  comparison  purposes,  observers  also 
Judged  the  velocity  of  a  silent  source  visually,  with  the  chamber 
illuminated.  Both  the  velocity  and  duration  of  band  limited  (100  -  1000  Hz) 
noise  were  varied.  Observers  were  quite  consistent  in  their  estimates  of 
velocity.  Judgements  of  auditory  velocity  closely  paralleled  those  of  visual 
velocity.  Both  followed  a  simple  power  law  with  a  slope  slightly  less  than 
1.0;  although  for  both  modalities  absolute  velocity  was  overestimated.  In 
addition,  Perrott  found  that  the  duration  of  a  moving  stimulus  has  to  exceed 
100  msec  in  order  for  the  observers  to  detect  motion.  It  was  concluded  that 
the  auditory  system  is  an  capable  of  detecting  velocity  as  is  the  visual 
system. 
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Illusory  auditory  motion  also  occurs  under  some  conditions  in 
lateralization  tasks.  The  intracranial  position  of  an  auditory  stimulus  is 
dependent  upon  the  Interatlmulue  onset  Interval  (ISOI),  That  is,  the  interval 
between  the  onsets  of  dichotically  presented  stimuli  determines  whether  two 
separate  auditory  events  are  heard,  or  whether  a  single  auditory  event 
(phantom  source)  ia  heard.  The  fused  phantom  round  can  althar  be  perceived  as 
stationary  or  as  moving  from  ear  to  ear.  Briggs  and  Perrott  (1972)  varied  tha 
ISOI  In  a  dichotlc  listening  paradigm  and  asked  subjects  to  report  tho 
poeltlon(s)  of  the  stimuli.  Por  100  msec  signal  duration,  ISOIs  less  chan 
15  msec  tended  to  produce  a  single  fixed  source  Image.  For  ISOIs  between  15 
end  75  msecs,  listeners  perceived  e  single  source  which  moved  continuously 
from  the  lead  to  the  lag  ear.  ISOIs  between  75  and  110  msec  produced  a  fused 
image  whose  movement  could  best  be  described  as  broken;  and  at  ISOIs  greater 
Chan  110  msec  tha  stimuli  tiers  heard  as  two  successive  sounds  located  at  each 
ear.  Perrott  and  Basra  (1974)  varied  tha  signal  duration  and  found  that  as 
duration  increased,  the  threshold  for  each  of  tha  above  effects  shifted 
towards  longer  ISOI  values.  That  Is,  the  perception  of  a  fused,  continuously 
moving  image  shifted  in  ISOI  from  a  mean  of  19  msec  for  a  10  msac  signal  to 
62  mate  for  a  300  maac  signal.  These  results  indicate  that  apparant  movement 
la  largely  a  function  of  tha  ISOI  and  signal  duration. 

Closaly  related  to  auditory  movement  In  lateralisatlon  casks  Is  the 
perception  of  the  spatial  extent  of  an  auditory  image.  Spatial  extent  can  be 
defined  aa  tha  amount  of  intracranial  area,  or  volume,  that  an  Image  is 
perceived  to  occupy.  Spatial  extent  la  a  direct  function  of  the  degree  of 
coherence,  or  correlation,  between  two  input  signals  (Blauert,  1982;  1983). 
Fully  coherent  signals  (r  ■  1.0)  ere  perceived  ss  a  fused  auditory  image  with 
limited  epetiel  extent.  As  coherence  decreases,  from  1.0  to  approximately  .4, 
the  area  of  tha  auditory  image  tends  to  Increase.  Signal#  below  .4  coherence 
are  perceived  as  two  separate  events,  each  with  limited  extent.  Blauert 
(1983)  hypothesized  that  the  auditory  system  must  perform  some  sort  of  cross'* 
correlation  analysis,  where  the  correlation  between  the  signals  provides 
Information  about  the  location  <ind  extent  of  the  source. 
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2.7  Effect*  of  Noise  on  Auditory  Localisation 


Ii<  natural  settings  moat  judgements  of  sound  location  are  made  in  a 
background  noise.  Research  has  been  conducted  to  assesa  the  noise 

characteristics  that  interfere  with  or  mask  the  location  of  the  sound 
source,  and  the  changes  in  localisation  accuracy  that  occur  in  the  presence 
of  noise.  Noise  effects,  in  lateralisatlon  and  masking  paradigms,  have 
provided  some  additional  evidence  on  noise  effects  in  localisation.  These 
effects  will  be  reviewed  as  they  pertain  to  localisation  in  the  presence  of 
noise. 

The  studies  on  auditory  localisation  have  asaesaed  the  impact  of  noise 
on  localisation  accuracy,  the  minimal  audible  angle,  and  mask  frequency 
effects.  Jacobsen  (1976,  cited  in  Blauert,  1983)  investigated  the  effects 
of  noise  on  localisation  accuracy,  and  found  that  localisation  was  as 
accurate  in  the  presence  of  noise  (automotive  traffic  at  30  aones)  as  in 

quiet,  if  the  sound  source  was  from  10  to  13  db  above  masking  threshold.  An 

early  study  by  King  and  Laird  (1930)  identified  the  effects  of  noise  on  the 
minimum  audible  angle  for  click  stimuli  in  the  frontal  part  of  the 

horisontal  plane.  The  difference  threshold  increased  from  1.8  degrees  in 
quiet  to  4  degrees  with  45  db  noise  present. 

Moter  (1964,  cited  in  Durlach  and  Colburn,  1978)  varied  the  angular 

separation  between  the  target  and  masker,  and  measured  the  detection 

threshold.  The  detection  threshold  decreased  as  the  angle  between  the 
target  end  masker  increased.  Kock  (1950)  investigated  the  threshold  of 
detectability  for  speech  in  a  white  noise  background  as  a  function  of 

interaural  time  differences.  When  the  signal  and  noise  arrived  at  the  ears 
with  the  same  interaural  time  delay,  the  threshold  for  the  speech  signal  was 
between  8  to  16  db  greater  than  when  the  interaural  delays  differed.  These 
results  were  interpreted  as  an  "unmasking"  of  the  signal  as  the  interaural 
time  difference  between  the  signal  and  the  noise  mask  increased. 

Explanatory  models  of  this  binaural  "unmasking"  phenomenon  have  been 

developed  by  Jeffress  (1972)  and  Durlach  (1972). 
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Canevet,  Germain  end  Scharf  (1980)  investigated  the  effect  of  signal 
masking  aa  both  the  signal  and  mask  frequencies  were  varied.  Critical 
frequency  bands  of  maximal  masking  were  identified  with  regards  to  the 
signal  frequency.  In  general,  as  the  frequency  band  of  the  mask  overlaps 
with  the  signal  frequency,  localisation  accuracy  decreases.  The  effective 
critical  bandwidth  of  a  mask  tended  to  increase  as  the  signal  frequency 
increased,  and  the  critical  bands  were  not  symmetrical  around  the  signal 
center  frequency.  Canevet,  Germain  and  Scharf  (1979)  and  8charf,  Canevet, 
Buus,  and  Marchioni  (1982)  presented  a  mask  which  preceded  signal  onset, 
within  the  bounds  of  the  precedence  effect.  The  masking  stimulus  was  close 
in  frequency  to  the  signal.  They  found  that  the  localisation  threshold  was 
10  to  20  db  greater  than  the  signal  detection  threshold.  As  the  frequency 
difference  between  the  signal  and  mask  increased,  the  detection  threshold 
decreased  more  rapidly  than  did  the  localisation  threshold. 

The  effects  of  noise  on  the  lateralisation  of  a  signal  have  been  more 
extensively  studied  (Gaskell  and  Henning,  1979)  Ito,  Thompson,  and  Colburn, 
1979)  McFadden,  1969}  Robinson  and  Egan,  1974;  Wilbanks,  1983).  In  goneral, 
tha  presence  of  noise  tends  to  reduce  lateralisation  accuracy  when  the 
interaural  time  envelopes  of  the  signal  and  noise  overlap.  These  results 
have  been  interpreted  in  terms  of  phase  cancellation  and  reinforcement. 
When  the  signal  and  noise  bands  overlap  in  frequency,  phase  cancellation  and 
reinforcement  increase;  therefore  the  noise  tends  to  interfere  with 
detection  and  lateralisation  more. 

The  ability  to  localise  sound  in  the  presence  of  background  noise  io 
often  observed  in  daily  life.  The  aptly  named  "cocktail  party  effect," 
(Batteau,  1968;  Durlach  and  Colburn,  1978)  adequately  characterizes  the 
ability  to  localise  and  attend  to  a  source  amongst  a  din  of  background 
noise.  The  evidence  presented  above  has  identified  the  effects  of 
background  noise  on  localisation,  lateralisation,  and  detection  as  a 
function  of  the  angular  separation  of  the  signal  and  noise  sources,  signal 
and  noise  frequency,  and  interaural  time  differences.  However,  further 
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research  is  needed  Co  identify  signal  characteristics  which  lead  to  minimal 
disruption  of  auditory  localisation  in  noiaa. 

2.8  Cockpit  Applications  of  Simulated  Auditory  Localisation 

Tha  foragoing  raview  of  rasaarch  suggests  that  the  auditory  modality 
could  ba  used  to  convey  accurate  spatial  information.  Tha  human  obaarvar'a 
ability  to  localise  auditory  signals  is  completely  ntglected  and  unused  in 
modern  aircraft.  Several  previous  attempts  to  develop  auditory  displays  of 
spatial  information  for  aircraft  never  found  any  application  (c f.,  Forbes, 
1946 1  Mudd,  1965)  Mudd  and  McCormick,  1960).  Kowevarf  in  tha  twenty  years 
since  Mudd's  study,  our  knowledge  of  auditory  local iiation  has  improved 
tremendously.  The  availability  of  this  new  knowledge  presents  an 
opportunity  to  relieve  the  visual  modality,  which  is  currently  over-taxed  in 
cockpit  settings. 

This  section  addresses  three  areas  in  which  spatial  information  conveyed 
by  way  of  audition  could  potentially  be  used  to  improve  aircrew  performance, 
Tha  first  of  these,  directional  cueing  in  head-coupled  control/diaplay 
systems,  is  the  main  focus  of  the  present  project.  A  second  area,  of  which 
the  first  is  a  special  case,  is  the  enhancement  of  the  pilot's  situational 
awareness.  A  third  area  is  tha  enhancement  of  audio  communications  and 
audio  warnings  through  exploitation  of  the  human's  ability  to  attend 
selectively  to  spatially  separated  messages. 

2.8.1  Directional  Cueing 

The  sensitivity  and  range  of  advanced  electromagnetic  (EM)  sensors 
aboard  modern  military  aircraft  far  exceed  the  capabilities  of  the  pilot's 
unaided  senses.  Modern  sensors  such  as  radar,  infrared  (IR)  and  electro- 
optical  (E0)  systems  can  each  provide  a  separate,  amplified  view  of  the 
outside  world  beyond  the  range  of  the  naked  eye.  Unfortunately,  the  pilot 
is  often  over-loaded  with  visual  information  from  a  multitude  of  cockpit 
displays,  and  therefore  cannot  effectively  use  all  the  information  available 
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from  thasa  sensors.  Furthermore,  because  apace  available  on  the  cockpit 
display  front  panel  is  severely  limited,  the  displays  associated  with  each 
sensor  must  be  made  so  small  that  only  a  part  of  the  information  available 
can  ba  affectively  conveyed  to  the  pilot. 

Those  problems  have  been  partially  alleviated  by  integrating  some  of  the 
systems  which  provide  information  and  by  presenting  integrated  information 
on  head-up  displays  (HUD's),  and  multifunction  displays  (MFD's).  The 
effectiveness  of  these  systems  depends  in  large  part  on  how  effectively  the 
information  is  integrated.  At  the  lowest  level,  two  kinds  of  information, 
one  highly  symbolic  and  the  other  concrete,  are  simply  superimposed,  as  when 
flight  control  symbols  are  overlayed  on  the  outside  visual  scene  using  a 
HUD. 

At  a  aore  advanced  level,  a  concrete  picture  of  the  outside  world 
representing  threat  envelopes,  the  desired  flight  path,  etc.,  can  be 
generated  on  an  MFD  and  overlayed  on  the  outside  visual  scene  by  projecting 
tha  picture  onto  a  HUD.  One  of  the  most  significant  aspects  of  this  type  of 
system  is  ite  ability  to  represent  spatial  information  from  various  avionics 
systems  in  the  most  concrete  form  possible,  as  (virtual)  objeccs  in  visual 
epace.  Unfortunately,  the  MFD-HUD  picture  is  confined  to  the  angular  region 
around  the  longitudinal  axis  (boresight)  of  the  aircraft.  This  means,  for 
example,  that  a  reticule  on  the  HUD  can  be  used  to  aim  a  weapon  only  when 
the  target  ie  boresighted  with  the  aircraft. 

The  Head-Coupled  Control/Display  technology  now  under  development  at 
AAKRL/HBA  takes  the  next  logical  step  beyond  the  stationary  HUD. 
Information  from  avionics  sensors  is  represented  as  concrete  objects  in  a 
panoramic  visual  picture.  This  is  accomplished  by  presenting  a  virtual 
image  on  the  pilot's  helmet-mounted  visor.  The  image  projected  on  the  visor 
changes  as  the  pilot  rotates  his  or  he  heed,  just  as  the  real  world  scene 
normally  changes  as  we  turn  our  heads.  This  system,  called  the  h« line t - 
mounted  display  (HMD),  presents  an  omnidirectional  view  of  the  outside  world 
enhanced  by  information  from  avionics  sensors.  The  pilot  can  also  use  his 


or  har  head  position  to  designate  a  target  by  aligning  a  reticula,  which 
remains  stationary  on  the  visor,  over  the  desired  target.  Magnetic  sensors 
on  the  pilot's  helmet  provide  continuous  measurement  of  the  pilot's  head 
orientation,  and  therefore  of  the  pilot's  line  of  sight  when  the  eyes  are 
positioned  straight  and  level.  This  sytsm  is  called  the  hslmst-mountad 
sight  (HMS). 

To  date,  the  Head-Coupled  Control/Display  technology  program  has  focused 
on  the  visual  modality.  Although  vision  is  our  primary  means  of  obtaining 
spatial  information  about  the  world  around  us,  audition  also  plays  a 
significant  role.  In  the  natural  environment,  sounds  are  often  a  cue  as  to 
the  direction  of  important  objects  or  sources  of  information.  Upon  hearing 
a  novel  sound,  a  human  generally  rotatea  his  or  her  head  and  eyes  toward  the 
source  and  acquires  it  visually.  Aural  information  is  a  natural  cue  for 
where  to  look  in  our  visual  environment. 

Simulated  auditory  localisation  (SAL)  could  be  used  to  cue  the  pilot 
where  to  look  in  the  virtual  visual  world  provided  by  the  Head-Coupled 
Control/Display  system.  The  system  currently  conveys  directional 
information  by  presenting  visual  symbols,  such  as  arrows,  on  the  HMD. 
Directional  information  could  instead  be  coded  as  appropriate  aural  signals 
to  the  ears.  The  acoustic  cues  critical  to  auditory  localisation  can  now  be 
simulated  and  presented  dichot ically,  via  earphones,  with  sufficient 
fidelity  to  achieve  localization  performance  comparable  to  that  with  the 
unaided  ear  (Batteau,  1965;  Weinrich,  1982).  Batteau  generated  acoustic 
localisation  cues  by  recording  from  miniature  microphones  placed  in  a 
physical  model  of  the  head  and  pinnae.  Obviously,  an  electronic  simulation 
would  be  necessary  to  make  SAL  practical  for  cockpit  applications. 

Fortunately,  the  knowledge  is  now  available  to  make  at  least  a  "brute- 
fcrce"  electronic  simulation  feasible.  The  brute-force  method  would  involve 
storing  a  large  number  of  transfer  functions.  Each  function  would  deserve 
the  relationship  of  the  frce-field  sound  (if  the  observer's  head  were  not 
present)  to  the  proximal  stimulus  at  the  entrance  to  the  ear  canal  for  each 

73 


apparent  position  of  the  sound  source  to  be  simulated.  Obviously,  a  large 
number  of  positions  would  be  involved,  and  an  algorithm  for  transitioning 
between  positions  would  be  needed  in  order  to  allow  for  head  movement.  The 
brute-force  method  could  be  implemented  by  using  waveform  digitisation 
methods,  like  those  used  in  speech  synthesisers.  Obviously,  a  rule-based 
simulation  would  be  more  elegant.  The  review  of  research  on  auditory 
localisation  presented  in  Sections  2.2  to  2.7  suggests  that  a  rule-based 
system  will  be  possible  in  the  near  future. 

There  are  at  least  three  ways  in  which  SAL  could  facilitate  a  pilot's 
visual  target  acquisition  and  overall  performance.  First,  using  SAL  instead 
of  visual  indicators  should  reduce  the  workload  in  the  often  over-used 
visual  modality.  Second,  the  visual  orienting  response  to  a  localised 
auditory  stimulus  is  natural  and  highly  automatic,  which  should  make  it 
faster  and  more  compatible  with  competing  tasks,  such  at  flight  control 
(cf. ,  Posner,  1978;  Shiffrin  &  Schneider,  1977).  Third,  recent  research  on 
auditory  localisation  indicates  that  auditory  localisation  can  be  highly 
accurate  if  the  listener  is  allowed  to  reorient  his  or  her  head  and  eyes 
toward  the  source  (see  Section  2.5).  It  may  be  possible  to  reorient  the 
head  and  eyes  more  accurately  with  SAL  than  with  symbolic  visual  indicators, 
thereby  reducing  the  number  of  eye  movements  required  to  acquire  a  target. 
These  considerations  suggested  that  SAL  should  be  especially  valuable  in 
facilitating  rapid  acquisition  of  visual  targets. 

2. 8. 1.1  Performance  Implications  of  SAL  Directional  Cues 

Further  research  is  needed  to  investigate  issues  related  to  the 
feasibility  and  potential  benefit  of  using  SAL  as  a  directional  cue  in  the 
cockpit.  As  implied  above,  these  issues  include  the  effectiveness  of  SAL 
relative  to  other  methods  of  providing  directional  information  for  a  visual 
acquisition  task,  and  the  relative  time-sharing  efficiency  of  SAL  with 
competing  flight  tasks.  For  example.  SAL  should  be  compared  to  conventional 
cockpit  visual  displays  (CVD's),  HUD's,  HMD's,  and  synthesized  speech 
displays  (without  acoustic  location  cues).  Both  speech  and  non-spcech  SAL 
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signals  should  b«  compared,  and  all  of  tha  diractional  cuaing  oat hod*  should 

l 

be  evaluated  at  various  levels  of  workload  imposed  by  various  types  of 
concurrent  tasks. 

Wickens,  Sandry  &  Vidulich  (1983)  have  proposed  an  extension  of  the 
stimulus-response  (S-R)  compitibllity  principle  to  include  a  central 
processing  (C)  component.  Tha  8-C-R  principle  postulates  that  certain 
combinations  of  input  modality,  output  modality,  and  type  of  central 
processing  (i.e.,  verbal,  spatial)  result  in  hettsr  task  performance  than  do 
others.  According  to  the  principle,  tasks  which  require  spatial  processing 
and  a  manual  reaponse  should  be  performed  better  when  the  input  modality  is 
visual  than  when  it  is  auditory.  The  principle  predicts  that  a  CVD,  HUD,  or 
HMD  should  produce  faster  visual  acquisition  than  SAL.  However,  orientation 
of  the  eyes  in  response  to  auditory  localisation  cues  is  a  highly  natural, 
perhaps  "automatic"  process  (cf. ,  Posner,  1978).  SAL  cues  may  therefore 
produce  faster  visual  acquisition  than  do  the  CVD's,  HUD's  or  HMD's.  Such  a 
finding  would  represent  an  important  exception  to  the  8-C-R  compatibility 
principle. 

Not  only  should  SAL  produce  better  performance  than  symbolic  visual 
displays  (i.e.,  CVD's,  HUD's,  and  HMD's)  in  single-task  situations,  but  it 
should  also  be  superior  in  multitask  settings.  Two  lines  of  reasoning 
suggest  that  a  SAL-visual  acquisition  task  should  be  time-shared  more 
efficiently  with  a  concurrent  visual-npatial-manual  task  than  are  visual 
acquiaition  tasks  based  on  symbolic  visual  displays. 

First,  the  multiple-resource  theory  of  task  interference  (c.f.,  Navon  & 
Copher,  1979;  Norman  &  Bobrov,  1968;  Wickens,  1980)  suggests  that  the  time¬ 
sharing  efficiency  of  concurrent  tasks  decreases  as  the  amount  cf  overlap 
of  input  modalities,  type  of  central  processing,  and  output  modalities 
increases.  Therefore,  increasing  concurrent  visual  workload  during  a  visual 
acquisition  task  should  affect  visual  acquisition  time  more  when  directional 
information  is  provided  by  a  CVD,  HUD,  or  HMD  than  when  it  is  provided  by 
SAL  cues,  since  SAL  involves  auditory  input. 


Second,  auditory  localization  is  also  probably  more  natural  or  automatic 
than  ia  localisation  based  on  a  symbolic  vieual  display.  This  also  leads 
one  to  expect  that  visual  acquisition  time  should  be  more  affected  by 
competing  visual  workload  in  the  CVD,  HUD,  and  HMD  conditions  than  in  the 
SAL  condition.  When  directional  information  is  provided  by  synthesized 
speech,  increased  workload  may  affect  visual  acquisition  time  to  an 
intermediate  extent,  since  localisation  on  the  basis  of  speech  is  not 
automatic,  but  on  the  other  hand,  the  input  modality  differs  from  that  of 
the  competing  task. 

With  a  concurrent  auditory-verbal-speech  task,  such  as  cockpit 
eosnunications,  multiple-resource  theory  predicts  that  the  CVD,  HUD,  and  HMD 
should  produce  faster  visual  acquisition  than  would  SAL,  since  SAL  and  the 
concurrent  task  share  the  same  input  modality.  The  high  automaticity  of 
auditory  localisation  suggests,  however,  that  it  may  be  little  affected  by 
competing  workload  (cf.,  Posner,  1978;  Shiffrin  &  Schneider,  1977). 
Contrary  to  multiple-resource  theory,  this  leads  to  the  expectation  that 
visual  acquisition  time  with  SAL  directional  cues  may  be  less  affected  by 
competing  workload  than  the  HUD,  HMD,  and  CVD  conditions.  Such  a  result 
would  be  an  exception  to  the  multiple-resource  theory  of  task  interference, 
and  would  therefore  have  implications  for  task  and  human-machine  interface 
design. 


2. 8. 1.2  Other  Issues  Affecting  the  Usefulness  of  SAL  Directional  Cues 

Other  issues  related  to  the  viability  and  effectiveness  of  using  SAL  to 
provide  directional  cues  for  visual  acquisition  in  the  cockpit  include:  (1) 
the  accuracy  of  SAL  in  noise,  (2)  the  rapidity  with  which  listeners  cun 
adapt  to  8AL  cues,  and  (3)  the  effectiveness  in  terms  of  localization  speed 
and  accuracy  of  various  types  of  free-field  SAL  stimuli.  This  last  issue 
was  addressed  in  the  experiments  reported  in  Section  5.0  of  this  report. 
Findings  discussed  previously  in  Section  2,2.3  suggest  that  listener:;  adapt 
to  SAL  cues  within  minutes.  The  findings  reported  in  Section  2.7  indicate 
that  the  Impact  of  noise  on  SAL  accuracy  will  depend  on  the  bandwidth  and 
frequency  composition  of  the  stimulus.  The  larger  the  number  of  frequency 
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components  in  tho  free-field  SAL  stimulus,  or  the  greater  the  bandwidth,  the 
more  resistent  it  should  be  to  masking  in  varying  noise  conditions. 
Research  is  needed  to  quantify  exactly  how  much  typical  aircraft  noise 
degrades  SAL  performance  and  to  identify  the  propertiea  of  SAL  stimuli  which 
best  retain  their  usefulneas  in  noise. 

2.8.2  Enhancement  of  Situational  Awareness 

The  use  of  SAL  to  cue  the  pilot  where  to  look  in  the  visual  environment 
is  a  special  case  of  using  SAL  to  enhance  the  pilot'a  situational  awarenesa. 
In  addition  to  redirecting  the  pilot's  visual  attention  however,  enhancing 
situational  swareneas  alao  implies  keeping  the  pilot  informed  while  making 
fewer  demands  on  visual  attention.  For  example,  auditory  cues  could  provide 
information  about  terrain  clearance  and  obstacles  in  low-level  flight. 
Auditory  cues  could  alert  the  pilot  when  a  dangeroua  condition  develops,  and 
there foro  reduce  the  amount  of  visual  fixation  time  and  visual  proceasing 
the  pilot  spends  monitoring  the  situation.  For  example,  the  distance  to  an 
approaching  obstacle  could  be  cued  by  the  intensity  and  frequency 
composition  of  a  sound.  Change  in  interaural  amplitude  differences  with 
head  movement  might  also  provide  a  useful  cue  for  distance  (see  Section 
2.4).  Different  obstacles  would,  of  course,  be  perceptually  separated  by 
providing  different  acoustic  cues  for  aaimuth  and  elevation.  The  findings 
reviewed  in  Section  2.3  suggest  that  frequency  could  be  used  as  a  cue  to  the 
elevation  of  an  obstacle.  The  relative  speed  of  approach  toward  an  obstacle 
could  be  cued  by  the  rate  of  change  of  intensity  and  rate  of  change  of 
frequency  composition.  Approach  speed  might  also  be  cued  by  a  general  shift 
in  frequency,  simulating  the  Doppler  effect. 

At  n  more  advanced  level,  simulated  and  enhanced  echo-location  cues 
could  be  used  to  help  the  pilot  judge  distances  to  obstacles  in  nap-of-the- 
earth  flight,  supplementing  visual  cues.  It  might  also  be  possible  to  use 
SAL  to  help  maintain  the  pilot'a  spatial  orientation  when  flying  in  low 
visibility  conditions  (i.e.,  as  an  anti-vertigo  cue).  Appropriate  use  of 
auditory  coca  for  functions  such  ao  these  might  greatly  enhance  the  pilot's 
situational  awareness  with  very  little  expenditure  of  mental  processing 


resources. 
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The  major  reason  for  using  audition  to  enhance  situational  awareness 
would  be  to  facilitate  quick  recognition  and  correction  of  dangerous  flight 
conditions  whan  the  pilot's  attention  Is  directed  elsewhere.  The  appropriate 
tact  of  such  cues  is,  thsrefors,  in  high  workload  conditions. 

2.8.3  Enhancement  of  Communications  and  Audio  Warnings 

A  well  known  aspect  of  audition  is  binaural  “unmasking"  or  the  binaural 
masking  lsvel  diffsrtnce  (MLD).  The  sensitivity  of  the  auditory  system  for  a 
signal  in  a  noise  background  Is  much  higher  for  dichotic  listening  conditions 
Chan  in  dlotlc  conditions.  For  example,  presenting  identical  500  Hz  signal 
and  noise  stimuli  to  both  ears,  but  reversing  the  phase  of  the  signal  (but  not 
the  nolsa)  in  one  ear,  lowers  the  signal  threshold  by  11  db  (Jeffress, 
1972).  The  MLD  la  smaller  st  high  frsqusnciaa,  e.g. ,  about  3  db  at  2  kHz  and 
nt  5  kHz.  Durlach  (1964)  proposed  chat  the  basis  for  binaural  unmasking  is 
lntaraursl  phase  or  time  differences  at  low  frequency  (<  1500  Hz)  and 

lntersursl  amplitude  differences  at  high  frequencies. 

The  well  known  “cocktail  party"  affect  is,  of  course,  an  example  of 

binaural  unmasking.  An  audlomatrlcally  normal  listener  can  follow  one  voice 
amidst  a  background  of  other  conversations  and  noise.  However,  if  the  same 
conversations  and  noise  ere  recorded  monaurally  and  played  back,  it  is  very 
difficult  to  follow  any  one  voles.  The  apparent  location  of  the  aource  helps 
the  listener  discriminate  among  the  voices.  It  is  clear  from  lateralization 
research  that  binaural  differences  contribute  to  this  discrimination  ability 
(cf.,  Mills,  1972).  It  Is  spparsntly  not  known  whether,  or  to  what  extent, 
monaural  localization  cues  also  contribute  to  the  ability  to  discriminate  a 
specially  distinct  signal  from  noise. 

Aircraft  communications  and  audio  warnings  are  generally  binaurally 

unmasked  from  ambient  noise  by  reversing  ths  signal  phase  at  one  earphone. 

However,  this  does  not  separata  the  signal  from  noise  or  other  concurrent 

signals  (such  as  communications  jamming)  coming  through  the  same  audio 
system.  The  detectability  and  intelligibility  of  communications  and  audio 
warnings  might  be  greatly  enhanced  if  each  signal  were  given  a  different 
apparent  direction.  This  would  be  relatively  easy  to  accomplish  for  threat 
warnings,  since  threat  warning  systems  generally  measure  angle-of-arr ival 
Information.  Aircraft  audio  warnings  and  messages  coming  over  UHF  as  opposed 
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VHF  radio  could  also  be  given  different  apparent  directions.  Research  is 
needed  to  determine  the  benefit  of  associating  cockpit  signals  with  different 
apparent  directions  in  conditions  of  noise  and  jamming. 

2.9  Racomaendatlona  for  Further  Research 

In  addition  to  the  appllcations-oriented  research  Issues  discussed  in  the 
last  section,  there  are  many  baaic  research  issues  which  should  be  addressed 
in  order  to  facilitate  the  application  of  SAL  in  the  cockpit.  A  few  of  these 
bseic  Issues  which  setm  most  Important  are  discuesed  In  this  section.  An 
obvious  nssd  Is  for  a  specification  of  the  necessary  end  sufficient  cues  for 
localisation  in  both  the  horiiontal  ai.d  vertical  planes.  Although  the 
binaural  cuaa  (ITD,  IAD)  can  ba  speclfiad  in  any  given  situation,  much  lass  la 
known  about  monaural  cues.  Isolation  of  tha  psychologically  naceasary  and 
sufficient  acoustic  features  for  localisation  would  graatly  simplify  the 
electronic  generation  of  SAL  cute. 

A  first  task  ia  to  account  for  tha  discrepant  raaulta  bet wean  llstanara 
and  taat  situations  in  tha  study  of  monaural  cuaa*  Tha  results  ravlawed  in 
Sections  2.2.3  and  2.3.2  show  that  narrow-band  noise  stimuli  in  tha  4  to  12 
kHz  frequancy  range  produce  tha  illusion  that  tha  aourca  la  locatad  in  tha 
median  aagittal  plana  in  soma  cases,  and  in  the  horizontal  plane  in  other 
conditions.  Moreover,  some  llstanara  map  frequencies  from  4  to  8  kHz  and 
again  from  8  to  12  kHz  into  the  front  quadrant  of  the  horizontal  or  median 
plane,  while  for  other  listeners  the  spatial  referents  of  frequencies  from  4- 
12  kHz  cover  the  entire  hemisphere  from  front  to  back.  These  results  suggest 
that:  (1)  contexc  and  prior  knowledge  play  an  important  role  In  the 

Interpretation  of  monaural  cues,  and/or  (2)  the  narrow-band  noise  stimuli  are 
not  fully  adequate  cues  for  source  direction  (l.e. ,  such  stimuli  do  not 
Include  all  the  acoustic  features  produced  by  interaction  of  the  free-fleld 
sound  with  the  pinnae  and  torso).  The  experiments  on  monaural  auditory 
Illusions  ahould  be  redone  using  noise  stimuli  with  features  that  mimic  the 
head-related  transfer  functions  In  the  4  to  12  kHz  region.  The  features 
should  be  systematical  ly  tested  to  determine  which  are  necessary  and 
sufficient  for  monaural  localization.  Further  transformation  of  the  fvee- 
field  sound  spectrum  by  the  pinnae  should  be  eliminated  by  using  headphone 
presentation.  This  research  should  also  carefully  control  and/or  manipulate 
the  listener's  cpectatlons  with  regard  to  possible  source  locations. 


A  second  related  research  question  is  whether  monaural  cues  consist  of 
narrow  frequency  bands  which  are  amplified  relative  to  the  other  parts  of  the 
spectrum,  or  ’comb-filter”  patterns  like  those  produced  by  a  multlple-delay- 
and-add  system  (see  the  discussion  at  the  end  of  Section  2.3.2).  Watkins 
(1978)  produced  the  illusion  of  s  moving  sound  source  in  the  median  plane  by 
varying  the  delay  of  one  component  of  a  two-delay-and-add  signal.  Varying  the 
daisy  produces  a  stimulus  with  a  changing  comb-filter  spectrum.  Stimuli  with 
a  fixed  combfilter  pattern  spectrin  have  apparently  not  been  tested  to 
determine  whether  they  produce  auditory  illusion*.  Further  research  is  needed 
to  determine  whether  combfilter  spectral  patterns  produced  by  a  multlple- 
delay-and-add-aystera  with  fixed  delays  produce  the  illusion  of  a  source 
located  at  a  flxsd  location  in  Cha  median  plane.  Similar  experiments  should 
also  ba  dona  in  the  horisontal  plane.  In  the  horizontal  plana,  the  illusion 
produced  by  combfiltar  spectral  patterns  in  combination  with  appropriate 
ITD'e  and  lAD's  should  ba  investigated.  Watkins  (1978)  pointed  out  that  the 
combfilter  patterns  produced  by  a  two-delay-and-add  system  with  an 
appropriate  range  of  delay  valuaa  (based  on  Batteau's,  1967,  analysis  of  pinna 
reflections)  conaiat  of  a  pattern  of  several  peaks  and  notches  in  the  audible 
frequency  range  (eee  Figure  17).  In  some  cases  a  single  amplitude  peak  (e.g. 
12  kHz)  Is  associated  with  two  apparent  source  directions.  Thus,  If  one 
simulated  only  a  peek  at  12  kHs,  one  might  expect  ambiguous  localization,  as 
the  studies  of  auditory  illusions  with  monaural  stimuli  Indeed  found. 

A  third  area  needing  research  is  the  Investigation  of  acoustic  cues  which 
enable  the  listener  to  judge  Che  distance  of  a  source.  There  tre  at  least  two 
ways  In  which  listeners  might  derive  absolute  Information  aa  to  source 
distance  through  repeated  nampllng  during  head  movements.  Lambert  (1974)  has 
shown  that  listeners  could  judge  source  distance  from  the  rate  of  change  of 
the  IAD  ee  thn  head  la  rotated  in  the  horizontal  plane. 

A  second  possible  mechanism  which  listeners  could  use  to  judge  distance 
1?  to  compare  the  magnitude  of  tha  ITD  to  monaural  pinna  cues  for  source 
elevation  and  azimuth.  For  example,  suppose  a  point  source  la  located  In  the 
horizontal  plane,  at  an  azimuth  of  90*  (opposite  the  left  ear).  If  the  source 
were  at  infinity,  the  ITD  would  be: 

3*  r 

r—  —  -  1.2  msec  , 

2  c 
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where  r  is  Che  radius  of  Che  head  and  c  la  Che  speed  of  sound.  As  Che  source 
moves  closer,  monaural  pinna  cues  remain  relatively  unchanged,  but  Che  ITD 
decreases.  In  Che  extreme  case,  where  Che  source  Is  located  on  tha  surface  of 
Che  head,  Che  ITD  la: 

i  -  »  800.  Usee  . 
c 

The  race  of  the  change  In  ITD  with  source  distance  Is  largest  when  Che  source 
is  ac  90*  azimuth  and  when  Che  source  :s  close  to  Che  head.  It  la  noc  clear 
whether  near-fleld  acoustic  effects  would  impair  the  usefulness  of  this  cue, 
or  whether  the  monaural  cues  are  sufficiently  accurate  to  allow  the  comparison 
required.  It  should  be  noted  chat  this  cue  does  not  require  head  movement, 
but  ic  does  require  a  lateral  source  location  and  a  broad-band  stimulus.  This 
and  the  head  movement-related  cue  suggested  by  Lambert  should  be  investigated 
experimentally. 

The  effects  of  head  movement  and  vision  on  auditory  localisation  is  a 
complex  area  in  which  there  are  many  Issues  that  warrant  further  research. 
Two  general  Issues  will  be  raised  here:  (!)  the  nature  of  the  cues  which  are 
modulated  during  head  movement,  and  (2)  the  affect  of  other  sensory  Inputs  and 
proprioception  on  auditory  localization. 

Section  2.5  discusses  three  hypotheses  which  are  variations  on  the  cue- 
modulation  position.  The  first  of  these  hypotheses  is  Lambert's  (1974) 
mathematical  model,  which  shows  that  listeners  could  make  absolute  judgements 
of  source  azimuth  in  the  horizontal  plane  based  on  the  rate  of  change  of  ITD 
with  the  angle  of  head  rotation,  0.  Obviously,  the  modulation  of  other  cues 
(e.g.,  IAD,  pinnae  cues)  might  also  play  a  role  in  the  facilltative  effect  of 
head  movement.  It  would  be  relatively  easy  to  test  the  cue  modulation 
hypothesis  by  Independently  manipulating:  (l)  the  characteristics  of  the 
stimulus,  so  as  to  control  the  availability  of  each  cue  (see  Section  4.0),  and 
(2)  the  presence  or  absence  of  cue  modulations,  by  either  restraining  the 
listener's  head  or  allowing  free  head  movement. 

Two  additional  hypotheses  related  to  the  cue  modulation  position  could 
also  be  easily  tested  in  the  same  experiment.  Freedman  and  Fisher  (1968) 
suggested  that  pinnae  cues  help  direct  the  Initial  orienting  response  to  a 
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sound,  but  do  not  serve  as  important  cues  during  head  movement.  As  noted  in 
Section  2.5,  if  their  hypothesis  is  correct,  then  the  time  required  to 
localize  a  sound  source  (i.e.,  response  time  in  a  speeded  task)  should  be 
smaller  when  pinnae  cues  are  available  during  head  movement  than  when  they  are 
not.  Pollack  and  Rosa  (1967)  propose  that  the  facultative  effect  of  head 
movement  is  due  to  the  fact  that  the  listener  can  take  advantage  of  the  high 
auditory  acuity  in  the  0*  azimuth  region.  This  hypothesis  could  be  tested  by 
manipulating  the  amount  and  dlractlon  of  head  mcveiMnt  allowed  during 
localization. 

The  feedback  control  model  preeented  in  Section  2.5  suggests  three 
different  areas  of  research  related  to  the  effects  of  head  movement  and  vision 
on  auditory  localisation.  These  erase  include  (1)  research  to  evaluate  the 
usefulness  of  the  feedback  control  approach  as  an  explanatory  model,  (2) 
research  to  evaluate  the  transfer  functions  relating  inputs  (reference 
signals)  in  the  model  to  outputs,  and  (3)  research  which  attempts  to  quantify 
individual  differences  in  localisation  performance  based  on  model  parameters. 

If  the  feedback  control  modeling  approach  la  useful,  then  it  should  be 
possible  to  deduce  end  experimentally  taet  previously  untested  hypotheses.  An 
implication  of  the  model,  shown  in  Pigure  18,  is  the  sxistence  of  a  purely 
auditory  illusion  analogous  to  the  oculomotor  *  visual  illusion  reported  by 
Matin,  Plcoult,  Stevane,  Edwards,  Young,  and  MacArthur  (1982)  and  Matin, 
Stavans,  and  Plcoult  (1983).  Moving  tha  heed  to  localize  a  sound  is  somewhat 
analogous  to  moving  the  eyes  in  visual  localization.  A  critical  experiment 
would  Involve  having  an  obssrver,  partially  paralyzed  by  the  drug  curare, 
rotate  the  head  until  he  or  she  perceived  a  sound,  located  laterally  from  the 
median  plane  of  the  body,  to  be  directly  In  front  of  the  head.  The  observer 
would  then  be  asked  to  locate  a  second  sound  In  the  median  plane  of  the 
body.  If  the  observer  perceives  that  the  head  Is  turned  farther  chan  lc 
really  la,  due  to  the  Increased  effort  required  because  of  the  curare,  then 
the  second  sound  should  be  positioned  so  as  to  deviate  from  the  median  plane 
In  the  direction  opposite  the  head  orientation.  Such  a  result  would  replicate 
Matin  et  el.'e  (1982,  1983)  results  with  visual  stimuli  and  provide  evidence 
that  proprioceptive  cues  associated  with  head  movement  also  (like  eve  movement 
Information)  provide  a  spatial  frame-of-reference  for  auditory  localization. 
This  experiment  is  essentially  a  test  of  the  postulated  control  loop  B  of  the 
feedback  control  model  (see  Figure  18). 
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If  th«  general  structure  of  the  feedback  control  model  can  be  verified , 
then  it  can  be  used  as  a  tool  for  quantifying  the  effect  of  sensory  inputs  and 
proprioception  on  auditory  localisation.  A  control  theory  approach  to 
lntersensory  effects  in  auditory  localisation  would  force  investigators  to 
exemlne  relationships  aaong  sensory  Inputs  and  motor  outputs  which  night 
otherwise  be  ignored.  The  approach  would  also  result  in  the  identification  of 
closed-loop  and  lntersensory  transfer  functions  which  would  be  useful  in 
predicting  auditory  localisation  perforaance. 

Studies  of  individual  perfornances  in  the  control-theory  franework  would 
make  possible  a  more  precise  quantification  of  the  basic  skills  underlying 
exceptional  performance*  Such  Information  could  be  very  useful  in  selection 
and  training. 

Two  additional  research  areas  which  are  Important  to  SAL  applications  are 
auditory  motion  perception  and  the  effects  of  noise  on  auditory  localisation 
performance.  Further  research  is  needed  to  determine  the  cues  responsible  for 
the  perception  of  motion  for  auditory  eources  which  are  actually  moving.  8uch 
information  is,  of  course,  crucial  to  simulating  auditory  notion*  A  salient 
question  related  to  noise  effects  on  localisation  is  the  extent  to  which 
typical  aircraft  background  noise  and  jamming  degrade  the  effectiveness  of 
auditory  directional  cues.  It  would  be  desirable  to  identify  the 
characteristics  of  auditory  cues  whose  effectiveness  is  minimally  degraded  by 
noise  and  jamming. 


3.0  FACILITY  DESCRIPTION 


3.1  Facility  Overview 

Th*  purpose  of  Che  experimental  facility  la  to  serve  as  a  teat  bed  fov 

I 

evaluating  Che  effectiveness  of  simulated  auditory  localisation  (SAL)  as  a 
method  of  providing  directional  information  in  head-coupled  control/display 
systems*  The  basic  task  presented  to  the  human  test  subject  is  to  indicate 
the  apparent  location  of  a  sound  source  based  on  SAL  cues  provided  to  the 
subject  by  way  of  headphones*  The  SAL  cues  change  in  real  time  as  the 
subject's  heed  moves,  just  as  the  proximal  acoustic  stimulus,  at  the  entrance 
to  the  ear  canal,  changes  in  normal  localisation.  The  subject's  head  position 
is  measured  and  used  in  real  tins  to  synthesize  the  SAL  cues,  which  are  fed 
back  to  the  subject  via  headphones. 

A  general  schematic  of  the  apparatus  for  monitoring  head  position  and  for 
simulating  localization  cues  is  shown  in  Figure  19.  Rotational  and 

translational  movement  of  the  subject's  head  la  measured  at  60  Hz  and  used  to 
switch  an  audio  signal  among  36  loudspeakers  surrounding  a  model  of  the  human 
head  and  ears  in  another  room.  Broad-band,  high-fidelity  microphones  are 
placed  at  the  entrances  to  the  ear  canals  in  the  model  head.  The  sound  at  the 
modal  head  is  amplified  and  fed  back  to  the  subject  via  headphones  to  produce 
SAL  cues. 

The  room  containing  the  model  head  and  ears  is  covered  with  sound- 
absorbent  material.  The  36  loudspeakers  are  spaced  at  10s  Intervals.  The 
model  head  remains  stationary,  and  the  audio  signal  is  transitioned  between 
loudspeakers  so  as  to  simulate  smooth,  continuous  movement  of  the  signal 
relative  to  the  head. 

The  following  subsections  provide  more  detailed  descriptions  of  the  SAL 
facility  hardware,  software,  and  experimental  space. 

3.2  Hardware  Design 

The  functional  diagram  of  the  hardware  for  the  SAL  system  is  depicted  in 
Figure  20.  An  IBM  personal  computer  (PC)  controls  a  runber  of  peripheral 
devices  attached  through  two  parallel  Interfaces.  The  peripheral  hardware 
required  to  Implement  the  SAL  system  can  be  regarded  as  three  major 
subsystems.  The  first  of  these  Is  an  Audio  production  and  control  system, 
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Figure  19.  Schematic  of  SAL  experimental  facility. 
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Figure  20.  Functional  diagram  of  the  SAL  system. 


made  up  of  an  audio  aource,  an  audio  controller  with  power  amplifier,  and  a 
loudspeaker  array.  A  second  Is  the  playback  system,  Including  the  binaural 
microphones  in  the  manikin's  head,  a  two-stage  amplification  system,  and  a 
headset.  The  third  subsystem  is  the  3-Space  Tracker  system,  which  provides 
head  and  hand  position  measurements* 

The  playback  system,  dupicted  in  Figure  21,  provides  the  audio  feedback 
path  from  the  speaker  array  to  the  subject.  Two  omnidirectional  microphones 
are  mounted  in  the  ears  of  tho  manikin,  located  in  the  center  of  the  speaker 
array.  The  audio  signal  is  praampllflad  by  two  30  dB  amplifiers  located  at 
the  base  of  the  oanlkin.  The  signal  then  pastes  into  the  control  room  where 
the  right  and  left  audio  signals  are  fed  into  two  Independent  volume  controls 
before  final  amplification.  The  outputs  of  the  amplifiers  are  connected  to 
the  headset  worn  by  the  subject.  The  headphones  are  Sennheiser  model  HD230's, 
which  are  mounted  in  a  Centex  HGO-26/P  flight  helmet  with  the  visor  removed. 

The  heart  of  the  audio  production  and  control  system  is  the  audio 
controller  and  power  amplifier.  The  controller/amplifier,  shown  in  Figure  22, 
controls  the  location  of  the  sound  source  in  the  arrsy  of  loudspeakers.  Up  to 
three  audio  signals  can  be  mixed  together  to  produce  an  audio  output  Kith  a 
gain  of  0  dB.  The  mixed  output  is  then  connected  to  three  digitally- 
controlled  step  attenuators.  After  attenuation,  the  three  audio  signal*  are 
amplified  and  connected  tc  the  array  of  36  loudspeakers  by  the  relay  control 
matrix. 

The  controller/anpllfl.er  id  deslgnad  to  that  only  three  relays  may  be 
closed  at  any  one  time.  Each  of  tha  three  relays  in  turn  Is  connected  to  a 
single  amplifier.  The  level  of  attenuation  applied  to  each  empll fier-relay 
pair  is  controlled  Independently  through  the  computer  interface.  An  external 
subject  ready  light  la  also  controlled  through  ths  computer  Interface. 

The  apparent  position  of  the  sound  source  relative  to  the  manikin  le 
controlled  by  activating  combinations  of  loudspeakers  sc  various  attenuation 
values.  Sines  the  loudspeakers  are  located  at  10°  Intervals  around  the 
manikin,  sound  sources  at  such  positions  (called  "reel"  sources)  are  genurated 
by  simply  activating  the  appropriate  loudspeaker  at  mlnimm  attenuation 
(maximum  intensity)  level.  Sound  sources  located  between  10*  intervals 
(called  '*phantomH  sources)  are  simulated  by  simultaneously  activating  s  pair 
of  adjacent  loudspeakers.  For  example,  s  source  at  5  degrees  azimuth  is 
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Figure  22.  Audio  controller  and  power  amplifier. 


simulated  by  activating  the  loudspeakers  at  both  0  degrees  and  10  degrees 
azimuth  at  equal  intensities  (attenuation  values).  In  order  that  phantom 
sources  not  sound  louder  than  real  sources,  the  two  loudspeakers  which  create 
the  perception  of  e  phantom  source  between  them  are  each  played  et  reduced 
power*  The  relative  power  delivered  to  the  adjacent  loudspeakers  determines 
the  perceived  position  of  the  phantom  sound  source.  The  development  of  the 
algorithm  for  controlling  apparent  source  position  was  undertaken  as  a  part  of 
the  validation  of  the  SAL  facility,  and  la  described  In  Section  4.4. 

Head-position  measurement  Is  accomplished  by  using  the  3-Space-Tracker 
system,  model  3ST002,  manufactured  by  Polhemus  Navigation  Sciences,  Inc.  The 
system  utilizes  a  low  frequency,  magnetic  field  to  determine  the  position  and 
orientation  of  a  sensor  in  relation  to  a  source.  The  system  has  a  maximum 
resolving  power  of  0.1  angular  degrea.  However,  its  accuracy  may  be  degraded 
when  the  source  or  sensor  la  In  close  proximity  (6  feet)  to  large  metallic 
objects.  The  system  can  use  one  or  two  sources  and  up  to  four  aensors.  Bach 
source-sensor  pair  is  uniquely  identified  in  the  system  as  a  "station.”  The 
system  can  measure  the  relative  position  and  orientation  of  up  to  four 
stations  at  the  aame  time. 

The  3-Space-Tracker  system  cen  be  interfaced  to  the  host  system  through  a 
parallel  or  serial  port;  however,  only  the  parallel  Interface  insures  the 
integrity  of  transmitted  data  through  a  hand-shaking  protocol.  A  customized 
record  of  information  representing  position  and  orientation  for  all  active 
stations  can  be  sent  to  a  host  computer  or  other  data  collecton  system 
continuously  or  at  selected  Intervals. 

In  the  present  research,  one  source  end  two  sensors  are  configured  as  two 
stations.  The  source  is  secured  to  e  nonmetalllc  support  attached  to  the 
celling  in  the  center  of  the  testing  room.  Metallic  objects  have  been  cleared 
from  a  6  foot  equare  area  around  source  and  sensor  to  reducs  interference  (see 
Section  3.4).  One  seneor  le  mounted  on  the  helmet  worn  by  the  test  subject. 
A  second  sensor  is  attached  to  a  response  indicator  assembly  held  by  the 
subject  throughout  experimental  testing. 
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3.3  Software  Architecture 
3.3.1  Major  Components 


Figure  23  shows  the  digital  hardware  and  software  conflgurationa  for  the 
SAL  system  in  block-diagram  form.  The  principal  hardware  components,  are  the 
IBM  PC  with  associated  special  purpose  cards  (monochrome  adapter,  memory 
expansion,  expanded  Input/output  capability),  the  audio  controller/ amplifier 
hardware,  and  the  3-Space-Tracker  system.  The  principel  software  components 
depicted  in  Figure  23  are  the  executive  control  routine,  the  assembly  language 
Interface,  and  the  speaker  look-up  table.  The  executive  control  routine, 

assembly  language  interface,  and  speaker  look-up  table  are  all  resident  in 
memory  during  program  execution.  The  double  and  single  headed  arrows  depict 
the  Information  transfer  between  software  and  hardware  components  in  the 

system  end  show  the  number  of  channels  used  in  each  function. 

The  executive  control  routine,  written  in  Pascal,  handles  four  mejor 

functions.  The  functions  include  Interaction  with  the  user,  program 

initialization,  experimental  control  and  data  collection,  and  calculation  of 
the  new  sound  position  relative  to  the  manikin  (see  Section  3.3.3). 

The  assembly  language  Interface  handles  time-critical  low-level 
communication  and  control  operations,  under  the  direction  of  the  executive 
control  routine.  Handling  these  functions  in  assembly  language  helps  to 

maximize  the  speed  of  these  real-time  control  activities. 

The  speaker  look-up  table  identifies  the  relays  which  must  be  closed  and 
the  values  of  each  of  the  three  attenuators  in  order  to  produce  a  sound  source 
at  a  given  azimuth  relative  to  the  manikin.  The  new  sound  azimuth  is  the 
address  in  the  table  at  which  the  relay  and  attenuator  device  codes  and  values 
are  stored.  The  relay  and  attenuator  device  codes  and  values  are  output  to 
the  audio  controller/amplifier  by  the  assembly  language  Interface. 

3.3.2  Overview  of  Software  from  User's  Perspective 

Control  software  for  the  SAL  project  was  designed  to  be  flexible  enough 
to  run  all  the  anticipated  experiments.  The  software  handles  the 
initialization  of  all  devices  and  warns  the  experimenter  if  data  storage  space 
is  low.  The  experimenter  can  set  values  for  all  experimental  parameters  and 
store  them  in  the  Master  Test  file.  Table  3  shows  the  input  parameters  of  the 
Master  Test  file. 
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Figure  23.  Block  diagraa  of  digital  control  logic  and  devices. 


TABU  3 


Input  Parameters  in  the  Master  Teat  File 

Number  of  trials 
Collect  data  (yes  or  no) 

Auto  or  manual  running  mode 
Nose-on  time  criterion  (.1  sec) 
Nose-on  angle  criterion  (degrees) 
Delay  to  start  trial  (sec) 

Maximum  sound  duration  (sec) 
Intertrial  interval  (sec) 

Monitor  ^Finger  positions, 

Head  position  or  Both 
Initial  Azimuth  file  name 
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The  Initial  Azimuth  file  contains  the  initial  azimuth  of  the  sound  on 
each  test  trial,  assuming  that  the  subject  Is  facing  the  0  degree  azimuth 
position.  The  initial  azimuth  is  also  the  apparent  azimuth  of  the  sound 
throughout  the  test  trial.  The  control  software  checks  for  the  existence  of 
this  file  and  prompts  the  experimenter  to  create  it  if  none  is  found  by  that 
name. 

The  starting  block  number  and  starting  trial  number  can  also  be  set  at 
the  beginning  of  a  test  session.  This  feature  proves  valuable  when  an 
interruption  of  testing  requires  restarting  testing  at  some  trial  other  than 
trial  number  one  or  some  block  other  than  block  one.  The  experimenter  can 
also  choose  to  balance  the  left  and  right  headphone  channels  at  the  beginning 
of  the  testing  block,  which  is  necessary  when  the  audio  stimulus  source  is 
changed. 

At  the  beginning  of  each  test  block,  the  experimenter  to  prompted  to 
enter  the  deta  filename,  subject  identification,  experimental  condition  and 
time  (date  of  testing).  Data  filenames  ars  chacked  before  the  experiment 
starts  to  ensure  no  duplicate  names  sre  used.  The  control  software  warns  the 
experimenter  that  the  existing  data  file  will  be  overwritten  unless  a  new 
filename  Is  specified.  The  Master  Test  file  parameters  collected  on  each 
block  are  written  ae  the  first  line  of  each  data  file  to  serve  as  a  permanent 
reference  to  identify  the  file  later. 

In  addition  co  managing  the  presentation  of  azimuth  locations  for  each 
trial,  the  control  software  also  reports  any  errors  detected  in  the  data  from 
the  3-Space  tracking  system  and  allows  a  trial  to  be  aborted  and  restarted  If 
difficulties  occur  during  the  course  of  a  trial.  At  the  end  of  a  testing 
block  all  files  are  closed.  After  each  test  block,  the  experimenter  is 
prompted  to  enter  another  Master  Test  file  if  an  additional  block  of  trlAls  Is 
to  be  run. 

The  events  transpiring  on  each  test  trial  from  the  experime iter' 8  and 
subject's  perspectives  are  as  follows.  First,  the  "ready"  light  at  0  degrees 
azimuth  is  illuminated,  Indicating  that  the  SAL  system  is  ready.  The  subject 
then  presses  the  consent  switch  on  the  response  indicator  to  indicate  that 
he/she  is  ready.  The  response  indicator  is  a  hand-held,  pistol-shaped 
object.  A  short  delay  then  ensues  (delay  to  start  trial)  before  the  system 
begins  to  monitor  the  subject's  head  position.  The  subject  Is  required  to  be 
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facing  0  degrees  azimuth  (“nose-on")  for  a  minimum  amount  of  time  with  a 
minlmisn  accuracy  before  the  trial  continues*  Once  the  nose**on  criteria  is 
met,  the  timer  is  started  and  the  sound  is  turned  on*  The  sound  remains  on 
until  the  consent  switch  is  pressed  a  second  time*  At  that  point  three  events 
happen  in  close  succession:  (1)  the  elapsed  time  is  recorded,  (2)  the 

position  and  orientation  of  the  listener's  head  and/or  the  position  of  the 
hand-held  response  indicator  is  recorded,  and  (3)  the  sound  is  turned  off. 
The  data  and  identifying  Information  for  the  trial  are  then  written  to  disk. 
If  there  are  further  test  trials  to  be  run  in  the  test  block,  the  inter trial 

Interval  then  ensues  before  the  ready  light  Is  illuminated  for  the  next 

trial.  After  the  last  trial  in  each  block,  the  ready  light  flashes  three 

times  to  signal  the  subject  that  the  block  has  been  completed. 

During  testing,  the  system  displays  certain  information  on  the  PC  display 
in  order  to  allow  the  experimenter  to  monitor  the  subject's  performance  and 
detect  any  system  problems.  The  data  is  displayed  in  near  real  time  and 
Includes  the  trial  number,  the  apparent  azimuth  of  the  sound,  the  indicated 
azimuth  of  Che  sound,  and  the  response  time. 

3.3.3  Real  Time  Calculation  of  Sound  Position 

The  azimuth  of  the  sound  relative  to  the  manikin  was  adjusted  in  real 
time  to  compensate  for  the  listener's  head  movements.  The  objective  of  these 
adjustments  was  to  make  the  sound  appear  to  the  listener  to  emanate  from  a 
stationary  source  at  a  specified  azimuth  for  each  test  trial,  called  AZQ.  The 
listener's  head  movements  were  measured  by  the  3-Space  Tracker  system.  As 
discussed  earlier,  the  listener  was  seated  in  the  “Subject”  room,  and 

localized  sounds  were  preaentad  via  headphones.  The  manikin  was  located  in 
the  adjacent  "Model”  room,  at  the  center  of  the  circular  array  of  36 
loudspeakers.  The  following  calculations  were  made  at  a  rate  of  60  Hz,  the 
sampling  rate  of  the  3-Space  Tracker  system. 

Three  numbers  regarding  the  listener's  head  orientation  were  input  to  the 
PC  at  60  Hz: 

XH*  "  the  coordinates  of  the  center  of  the  listener's  head 

relative  to  the  center  of  the  loudspeaker  array,  measured  as  shown  in 
Figure  24  (in  inches). 


QUADRANT  2 


AZjj  ■  the  azimuth  of  the  center  of  the  listener's  heed  relative  to 
the  center  of  the  array,  measured  as  shown  In  Figure  24  (in  degrtvsi- 

On  any  given  test  trial,  there  were  two  constants  which  entered  into  r h« 
calculations: 

r  -  the  radius  of  the  array  of  loudspeakers  (114  Inches). 

kZQ  •  the  azimuth  from  which  the  sound  should  appear  to  come  for  the 
listener  (l.e.,  the  “initial  azimuth"  discussed  previously). 

If  at  the  start  of  the  test  trial,  the  center  of  the  listener's  head  was  at 
the  point  (x  ■  0,  y  »  0)  and  the  head  was  oriented  in  the  horizontal  plane 
such  that  AZjj  ■  0,  then  the  sound  was  presented  to  the  manikin  at  azimuth  AZQ. 

Suppose  now  that  the  listener's  head  is  moved  to  a  point  (xj|,  yH)  and 
rotated  In  azimuth  an  amount  AZ^.  In  order  to  make  the  sound  appear  to  thit 
listener  to  still  come  from  the  same  point  in  space  (xf,  yg),  the  aound  must 
be  moved  to  azimuth  AZC  relative  to  the  manikin,  as  shown  in  Figure  24. 

It  Is  clear  from  Figure  24  that: 

AZC  ■  C  -  AZh  when  C  _>  AZH 

and  AZC  -  C  -  AZ^  +  360*  when  C  <  AZ^. 

The  angle  C  Is  calculated  aa  follows: 

C  -  Arctan  ((y8  -  yH)/(xg  **  *h)1  +  k 


Where 


xH  -  r  cos  (AZ0) 
and  y8  -  r  sin  (AZQ). 
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The  constant  k  -  180®  when  the  quantity  (xfl  -  x^)  <  0;  k  ■  360°  when  (xg 
-  Xjj)  2.0  and  (yg  -  yH)  <  0;  and  k  -  0  otherwise. 

3.3*4  Calculation  of  Perceived  Sound  Direction 

Aa  indicated  in  the  Master  Test  file  (Table  3),  the  SAL  system  can  be 

configured  to  measure  the  subject's  finger  position  only,  head  position  only, 

or  both.  The  so-called  "finger"  poeition  is  actually  the  position  of  the 
response  indicator  which  the  subject  holds  in  the  hand. 

Depending  on  the  instructions  given  the  subject,  either  head  position  or 
finger  poeition,  or  both  can  be  used  as  indicators  of  the  perceived 

(Indicated)  sound  direction.  When  the  "monitor  finger  poeition"  (F)  option  is 
selected,  the  indicated  sound  direction  written  to  the  data  file  is  based  on 
the  position  of  the  response  indicator,  as  discussed  below.  When  the  "monitor 
head"  position  (H)  option  is  selected,  the  indicated  sound  direction  is  based 
on  the  head  azimuth  and  position.  When  the  "monitor  both"  (B)  option  is 

activated,  two  indicated  sound  directions  are  written  to  the  data  record,  one 
based  on  head  azimuth  and  position  and  the  other  based  on  the  position  of  the 
response  indicator.  In  all  caaas,  the  indicated  sound  direction  i«t  based  on 
head/response  indicator  position  just  after  the  consent  switch  is  closed. 

3.3.4. 1  Calculation  of  Perceived  Sound  Direction  Based  on  Finger  Position 

The  3-Space  Tracker  provided  measurements  of  the  position  of  the  response 
indicator  relative  to  the  3-Space  Tracker  source,  which  was  positioned 
directly  above  the  listener's  head  at  the  stArt  of  each  block  of  test 

trials.  The  listener  wee  instructed  to  press  the  consent  switch  when  the 

response  indicator  wns  pointed  directly  toward  the  sound.  The  perceived 

direction  of  the  eound  was  taken  as  the  azimuth  of  the  response  Indicator 

relative  to  the  3-Space  Tracker  source. 

The  perceived  direction  of  the  sound,  AZr,  can  be  computed  from  the 
Cartesian  coordinates  (xr,  yr)  of  the  response  indicator  in  the  horizontal 
plane,  relative  to  the  3-Space  Tracker  source,  as  follows: 

AZf  ■  Arctan  [vr/xf]  +  k 
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•  #* 


Where  k  ■  180°  when  x_  <  0;  k  ■  360°  when  x_  >  0  and  y_  <  0;  and  k  ■  0 

r  r  —  r 

otherwise. 


3. 3. 4. 2  Calculation  of  Perceived  Sound  Direction  Baaed  on  Head  Orientation 
and  Position 

As  mentioned  above,  the  software  provides  an  option  for  recording  final 
head  position  (just  after  the  consent  switch  is  pressed)  as  an  indicator  of 
the  perceived  direction  of  the  sound  source.  Of  course,  using  final  head 
position  as  an  indication  of  perceived  sound  direction  is  only  appropriate 
when  the  subject  was  instructed  to  use  his  or  her  head  orientation  to  indicate 
sound  direction. 

If  the  subject's  head  Is  rotated  so  that  he  or  she  is  looking  exactly  in 
the  direction  of  the  sound  (that  is,  head  azimuth,  AZ^,  equals  the  initial 
azimuth  of  the  sound,  AZ,)  then  the  direction  of  the  sound  relative  to  the 

V 

manikin,  AZC,  will  be  0  degrees  azimuth.  Any  rotation  or  translation  of  the 
subjects  head  from  that  point,  such  chat  he  or  she  is  no  longer  directly 
facing  the  sound,  produces  a  change  in  AZC,  as  described  in  Section  3.3.3. 

Assuming  that  the  subject  Is  instructed  to  use  head  orientation  as  an 
indication  of  sound  direction,  the  deviation  represented  by  AZ_  can  be 
regarded  as  the  error  in  the  subject's  perception  of  the  sound  direction.  The 
actual  (correct)  azimuth  of  the  sound  Is,  of  course,  AZQ.  Then  the  perceived 
sound  location  indicated  by  head  position,  AZ^,  can  be  calculated  as  follows: 

AZp  -  AZ0  -  AZC. 

3.3.5  Response  Time  Measurement 

The  timing  resolution  of  the  IBM  PC  Internal  time-of-day  clock, 
configured  from  the  factory,  is  100  msec.  However,  a  computer  clock  is  Just  is 
counter,  driven  at  a  certain  frequency.  Therefore,  more  precise  timing  is 
possible  by  altering  the  frequency  of  the  timing  counter  to  produce  near- 
millisecond  accuracy  timing.  In  the  SAL  software,  the  counter  frequency  is 
set  to  1000  Hz,  which  produces  millisecond  resolution.  A  restriction  in  the 
use  of  the  timing  routine  is  that  disk  drives  cannot  be  accessed  during  timing 
periods  because  the  system  ie  being  Interrupted  more  frequently  than  with  the 
100  msec  resolution  clock  and  this  interferes  witn  disk  access.  The  original 
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counter  frequency  Is  restored  when  the  timing  period  has  ended,  in  order  to 
allow  disk  access. 


3.4  Experimental  Rooms 

Figure  25  depicts  the  SAL  experimental  room  layout.  The  manikin,  fitted 
with  microphones,  was  located  in  the  model  room,  surrounded  by  the  circular 
array  of  36  speakers.  The  manikin  was  the  KEKAR  model  DB4004  manufactured  by 
Industrial  Research  Products,  Inc.  The  manikin  was  fitted  with  molded  rubber 
pinnae,  model  DB065  (right  ear)  and  model  DB066  (left  ear).  The  microphones 
were  two  model  BT-1759'e,  provided  by  Knowles  Electronics,  Inc.  The 
microphones  were  positioned  at  the  manikin's  ear  canal  entrances.  The 
interior  of  the  manikin  head  was  stuffed  with  loose  fiberglass  insulation. 

The  loudspeakers  were  located  2.89  m  from  the  center  of  the  manikin  head 
at  10  degree  intervals.  Certain  tests  with  human  subjects  were  conducted  in 
the  model  room  rather  than  the  subject  room;  hence  similar  provisions  were 
made  in  both  rooms.  Referring  again  to  Figure  25,  note  that  a  portion  of  the 
celling  has  been  raised  in  both  the  model  and  subject  testing  room  to  clear 
any  electromagnetic  influence*  from  the  area  in  which  the  3-Space  Tracker 
system  was  used  to  monitor  head  and/or  hand  position.  This  precaution  insured 
that  the  highest  possible  accuracy  was  obtained  from  the  3-Space  Tracker 
system.  Figure  25  also  shows  placement  of  a  uniform,  opaque  curtain  drawn  in 
a  circle  around  the  testing  area.  The  curtain  serves  to  eliminate  any  visual 
cues  which  might  biso  the  subject' e  judgement  of  sound  source  location. 

Acoustical  treatment  was  applied  to  surfaces  in  the  model  room  to  reduce 
ambient  noise  levels  and  reverberation.  The  treatment  included  24  sound- 
absorbent  panels,  each  4  ft.  high  by  4  ft.  wide,  placed  just  outside  the 
periphery  of  the  loudspeaker  array.  The  panels  were  four-inch  thick  Sonex 
foam.  In  addition,  the  raised  portion  of  the  celling  in  the  model  room  was 
covered  with  fiberglass  insulation,  loosely  draped  to  form  a  scalloped 
surface.  The  acoustical  performance  of  the  room  is  discussed  In  Section  4.1. 
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Figure  25.  SAL  experimental  room  layout. 


4.0  VALIDATION  OF  TBS  EXPERIMENTAL  FACILITY 


In  this  section  is  described  a  9eries  of  tests  which  were  conducted  in 
order  to  evaluate  the  physical  and  psychological  fidelity  of  the  SAL 
facility.  This  section  also  covers  refinements  made  to  the  SAL  system,  based 
on  the  results  of  these  tests.  The  description  of  these  procedures  assumes  a 
knowledge  of  the  basic  architecture  of  the  SAL  facility,  which  is  presented  in 
Section  3.0* 

The  tests  and  refinements  dealt  with  five  different  topic  areas  which  can 
affect  the  fidelity  of  the  SAL  system:  (1)  the  acoustical  character  of  the 
sound  treated  room  in  which  SAL  cues  were  generated,  (2)  the  adequacy  of  the 
audio  production  system,  including  the  power  amplifiers  and  the  loudspeaker 
array,  (3)  the  frequency  response  of  the  audio  playback  system,  from  the 
manikin  to  the  headphones  worn  by  the  listener,  (4)  the  speed  of  the  real-time 
hardware  and  software  for  controlling  the  direction  of  the  sound  relative  to 
the  manikin,  and  (5)  formal  experimental  tests  designed  to  evaluate  the 
psychological  fidelity  of  the  SAL  system. 

4,1  Room  Acoustics  Tests 

Brief  acoustical  tests  were  conducted  to  quantify  the  ambient  noise  level 
and  reverberation  time  in  the  room  where  SAL  cues  were  generated.  This  room 
contained  the  acoustic  manikin  and  the  array  of  36  loudspeakers.  The 

acoustical  treatment  of  the  room  is  described  In  Section  3.4. 

Since  it  was  anticipated  that  most  of  the  experimental  tasting  would  take 
place  on  weekday  afternoons,  the  test  of  ambient  noise  level  was  conducted  on 
such  an  occasion.  This  provided  some  assurance  that  noise  sources  over  which 
the  experimenters  had  no  control,  such  as  traffic  noise  from  nearby  roads, 
would  be  at  a  level  representative  of  that  which  would  occur  during 

experimental  testing. 

The  ambient  noise  level  was  msasursd  at  the  center  of  the  room  at  the 
height  of  the  manikin's  ears  (62  inches).  The  microphone  was  a  Bruel  and 

KJaer  type  4165  condenser  microphone.  The  noise  level  was  measured  by  a  Bruel 

and  Kjaer  type  2203  precision  sound  pressure  level  meter. 

The  major  noise  sources  within  the  building  which  could  contribute  to  the 
noise  level  in  the  room  included  a  central  heating/cooling  unit,  a  central 
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blower,  and  a  peripheral  fan  in  the  ventilation  return  duct  Immediately 
outside  the  room.  The  ambient  noise  test  was  repeated  four  times:  once  with 
all  three  sources  on,  with  the  fan  off  and  the  other  sources  on,  with  the  fan 
and  the  blowers  off  and  the  heating/cooling  unit  on,  and  with  all  three 
sources  off. 

The  measured  ambient  noise  level  with  all  three  noise  sources  in  the 
building  on  was  36  dBA.  With  only  the  peripheral  fan  off,  the  ambient  noise 
was  33.5  dBA.  In  the  remaining  two  conditions  (central  blower  and  peripheral 
fan  off;  and  all  three  sources  off)  the  measured  noise  level  was  32  dBA. 

The  experimental  testing  was  conducted  with  the  peripheral  fan  off  at  all 
times.  The  heating/cooling  unit  and  central  blower  operated  Intermittently 
during  testing,  under  the  control  of  an  automatic  thermostat.  However,  their 
operation  made  only  a  small  difference  in  the  ambient  noise  level  (33.5  versus 
32  dBA). 

The  reverberation  decay  time  was  measured  for  a  wide-band  (10  kHz)  noise 
source.  The  source  was  a  Bruel  and  Kjaar  type  4205  sound  power  source.  The 
noise  source  was  placed  midway  between  the  center  and  one  wall  of  the  room.  A 
microphone  (same  as  that  used  in  the  ambient  noise  test)  was  again  located  in 
the  center  of  the  room  at  the  height  of  the  manikin's  ears.  The  microphone 
signal  was  boosted  by  a  Bruel  and  Kjaer  type  2807  microphone  preamplifier  and 
fed  to  a  Tectronlx  model  5223  digitizing  oscilloscope.  The  noise  source  was 
operated  at  92  dB  re  1.0  pW  for  approximately  10  seconds  in  order  to  allow  the 
noise  to  fill  the  room.  The  oscilloscope  was  triggered  such  that  it  displayed 
the  steady-state  noise  signal  just  prior  to  its  termination,  and  several 
seconds  of  the  decaying  noise  after  termination. 

The  display  vertical  dimension  was  in  voltage  units.  The  elapsed  time 
for  a  10  to  1  drop  in  voltage  was  recorded.  Since  voltage  is  proportional  to 
sound  pressure,  p,  and  the  Sound  Pressure  Level  (SPL)  is  defined  as  20  log 
(p/pref),  a  10  to  1  drop  in  voltage  is  equivalent  to  a  20  dB  drop  in  SPL. 
Thus  3  times  the  elapsed  time  required  for  a  10  to  l  drop  in  voltage  is  an 
estimate  of  the  reverberation  time,  defined  as  the  time  required  for  a  60  dB 
drop  in  SPL.  Hie  reverberation  time,  so  estimated,  was  approximately  350  msec 
for  the  broadband  noise  source. 
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4.2  Post-Installation  Tests  of  the  Audio  Production  and  Control  System 


The  audio  production  and  control  system  and  its  components  underwent  a 
series  of  four  different  tests  and/or  adjustments  after  the  system  was 
installed  in  the  sound-treated  room  (see  the  preceding  section  of  this 
report).  The  audio  production  and  control  system  includes  the  following 
components:  the  audio  source,  the  audio  controller  and  amplification  unit, 

and  the  36  loudspeaker  assemblies. 

4.2.1  Loudspeaker  Polarity  Check 

A  simple  polarity  check  was  conducted  to  ensure  that  all  the  loudspeakers 
were  radiating  in  phase.  A  1.5  volt  battery  was  connected  to  the  loudspeaker 
terminals,  with  the  positive  terminal  of  the  battery  to  the  loudspeaker 
terminal  marked  positive.  The  direction  of  the  movement  of  the  woofer  was 
noted.  The  loudspeaker  terminals  were  reversed  where  necessary  so  that  the 
woofer  moved  outward  when  the  battery  was  applied  as  described. 

4.2.2  Amplitude  Response  as  e  Function  of  Frequency 

The  purpose  of  this  test  was  to  determine  the  amplitude  response  over  the 
audible  frequency  range  of  each  of  the  36  loudspeakers  in  the  sound-treated 
room.  This  test  served  to  identify  any  major  differences  between  the 
loudspeaker  units  (including  the  crossover  circuits)  and  any  sources  of 
unusual  revarberation  in  the  room. 

A  white  noise  source  was  played  through  a  single  loudspeaker  and  the 
amplitude  response  as  a  function  of  frequency  from  0  Hz  through  25  kHz  was 
measured.  The  equipment  included  the  microphone  and  preamplifier  described  in 
the  last  section,  and  a  real-time  digital  spectrum  analyzer  (Hewlett-Packard 
model  3582A).  The  microphone  was  located  at  the  center  of  the  room.  The  test 
was  repeated  for  each  of  the  36  loudspeakers. 

A  typical  amplitude  response  is  shown  in  Figure  26.  The  upper  trace  Is 
the  white  noise  input  signal  and  the  lower  trace  Is  the  output  of  a 
loudspeaker  Installed  In  the  sound-treated  room.  The  vertical  scale  for  the 
upper  trace  has  been  shifted  downward  so  that  the  two  traces  can  he  displayed 
simultaneously.  The  overall  difference  in  the  amplitudes  of  the  two  traces  Is 
about  55  dBV.  Notice  that  the  loudspeaker  response  Is  virtually  flat  (within 
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6.  InDut  signal  (upper  trace)  and  amplitude  response 
r  trace)  of  the  audio  production  and  control  system. 
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2  dBV)  our  to  approximately  23  kHz.  All  36  of  the  loudspeakers  produced  this 
same  pattern,  except  that  the  locations  of  the  small  (less  than  1  dBV)  ripples 
varied  from  one  loudspeaker  to  the  next.  The  locations  of  the  ripples  in  the 
white  noise  input  signal  also  varied  from  one  test  to  the  next.  This  fact 
suggests  that  most  of  the  variation  In  the  loudspeaker  responses  Is  due  to 
sampling  and  quantification  error  in  the  spectrin  analyzer  and/or  variations 
In  white  noise  source  over  time. 

4.2.3  Equalization  of  the  Loudspeakers 

The  purpose  of  this  procedure  was  to  remove  any  differences  in  the 
intensities  of  the  loudspeakers  for  a  constant  signal  applied.  The  SPL  of 
each  loudspeaker  was  measured  with  the  audio  control  unit  set  at  minimum 
attenuation  value.  The  input  signal  was  white  noise  at  100  dBA,  measured  from 
the  mixer  test  point.  The  digital  control  unit  and  the  mixer  are  described  in 
Section  3.0. 

Since  the  SPL  of  the  loudspeakers  differed  slightly  at  minimum 
attenuation,  the  loudspeakers  were  equated  to  a  constant  SPL  by  increasing  the 
attenuation  values  on  the  more  Intense  units.  /is  a  consequence,  a  unique 
attenuator  value  was  defined  for  each  loudspeaker  as  its  new  minimum 
attenuation  value.  These  values  were  used  to  adjust  the  speaker  lookup  table 
which  controls  the  movement  of  the  sound  around  the  manikin,  as  described  in 
Section  4.4. 

4.2.4  Perceptual  Equivalence  of  Phantom  and  Real  Sources 

The  purpose  of  this  test  was  to  determine  whether  listeners  could 
discriminate  between  sounds  produced  by  a  single  loudspeaker  (called  a  "real" 
source)  and  sounds  produced  by  two  loudspeakers  activated  simultaneously  with 
the  same  total  power  (called  a  "phantom"  source),  In  the  SAL  facility, 
loudspeakers  are  located  at  10  degrees  Intervals  in  the  horizontal  plane 
surrounding  the  manikin  or  listener.  Therefore,  real  sources  can  be  played 
only  at  increments  of  10  degrees.  In  order  to  produce  a  sound  which  appears 
to  come  from  an  intermediate  azimuth,  two  loudspeakers  must  be  activated 
simultaneously.  The  relative  power  emitted  by  the  two  loudspeakers  determines 
the  perceived  location  of  the  sound. 


A  white  noise  signal  was  played  thro  lgh  "phantom'*  and  real  sources 
successively.  An  observer  was  seated  at  tho  center  of  the  loudspeaker  array 
with  his/her  ears  at  the  same  height,  as  the  center  of  the  loudspeaker 
assemblies.  The  observer  was  asked  to  report  whether  sounds,  played  for  3  sec- 
each,  came' from  real  or  phantom  sources.  The  room  was  darkened  in  order  to 
eliminate  visual  cues.  Two  experienced  observers  were  each  run  for  about  30 
trials  each.  Both  observers  performed  at  near  chance  level,  i.e.,  they  could 
not  discriminate  phantom  from  real  sources  under  these  conditions. 

A. 3  Tests  of  the  Audio  Playback  System 

Two  types  of  tests  were  performed  on  the  audio  playback  system.  First, 
the  SPL  output  of  the  left  and  right  headphones  was  measured  to  determine 
whether  the  two  units  were  of  equal  efficiency,  and  to  determine  whether  their 
responses  were  proportional  to  the  Input  intensity.  A  second  test  examined 
the  amplitude  response  over  the  audible  frequencies  of  the  entire  playback 
system,  including  the  model  room,  the  manikin,  the  microphones  in  the 
manikin* 8  ears,  the  two-stage  stereo  amplification  system,  and  the 
headphones.  Since  the  manikin  has  pinnae,  this  chain  of  components  should 
produce  amplitude  response  functions  over  frequency  like  those  produced  by  the 
human  head  and  pinnae  (l.e.,  a  head-related  transfer  function  or  HRTF) . 
Furthermore,  the  amplitude  response  functions  should  vary  as  a  function  of  the 
direction  of  the  source  relative  to  the  manikin  In  the  same  manner  that  human 
HRTFs  vary  as  a  function  of  source  direction. 

In  the  first  test,  the  measurements  were  taken  with  the  sound  pressure 
level  meter  and  microphone  described  in  Section  A.l.  The  microphone  was 
located  In  the  center  of,  and  approximately  one  inch  from,  the  radiating 
surface  of  the  headphone  being  tested.  The  headphones  were  removed  from  the 
helmet,  and  microphone  and  headphone  were  surrounded  by  fiberglass  insulation 
material  In  order  to  reduce  ambient  noise  Inputs  to  the  microphone  as  much  as 
possible.  The  Input  signal  was  inserted  after  the  second  headphone  amplifier , 
which  Is  the  same  point  from  which  the  left  and  right  headphone  channels  are 
equalized  during  experimental  testing.  The  test  was  conducted  with  three 
different  types  of  Input  signals  and  at  three  different  signal  intensities  for 
each  headphone. 
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Th«  test  conditions  and  results  are  shown  In  Table  4.  It  Is  clear  from 
the  results  that  the  left  and  right  channels  are  of  equal  efficiency,  and  that 

their  responses  are  proportional  to  the  input  Level.  The  response  of  the 

headphones  is  nearly  identical  for  the  white  noise  and  the  2  kHz  high-pass 
noise  algnale.  As  would  be  expected,  the  reeponse  is  somewhat  reduced  for  the 
1  kHs  low-pass  noise  signal. 

In  the  second  test,  the  microphone  was  positioned  in  front  of  the  left  or 
right  headphone,  end  isolated  from  ambient  noise  as  described  for  the  first 
test.  Tha  microphone  was  boosted  by  a  Bruel  and  Kjaar  type  2807  preamplifier, 
and  analysed  with  e  Hewlett-Packard  modal  3S82A  raal-tima  digital  spectrum 
analyser.  Tha  manikin  was  positioned  at  tha  canter  of  the  model  room,  and  one 
of  three  sources  in  the  array  of  36  loudspeakers  was  activated.  The  source 
activated  wee  in  the  horlaontal  plane  at  0,  90,  or  270  degrees  aslmuth 

relative  to  the  sagittal  plane  of  the  manikin*  Tha  input  signal  to  the 

loudspeaker  was  white  noise.  Tha  output  level  at  tha  center  of  the  room  was 
54  dBA.  Tha  signal  levels  of  tha  left  end  right  headphone  channel!,  at  the 
output  of  tha  second  headphone  amplifier,  were  set  to  105  dBA.  The  amplitude 
response  was  measured  in  dBV  relative  to  the  input  signal  at  intervals  of  500 
Hs,  from  500  Hs  to  20  kHs. 

The  results  of  the  second  test  are  shown  in  Figure  27.  Three  amplitude 
response  functions  ere  plotted  for  each  channel  of  the  playback  system.  The 
three  functions  ere  for  e  eound  source  at  0  dagraea  aslmuth,  for  a  source  at 
90  degrees  aslmuth  on  tha  asms  aids  as  the  setive  microphone  (ipsilateral) ,  or 
for  a  aourca  at  90  degrees  aslmuth  on  the  oppoelce  side  (contralateral).  Up 
to  12  kHz,  the  functions  are  similar  to  the  corresponding  averaged  HRTFs  for 
human  listeners  shown  in  Figure  2.  Since  the  present  data  were  collected  at 
intervals  of  500  Hs,  some  of  the  fine  structure  typical  of  HRTFa  does  not 
appear.  No  comparisons  can  ba  made  above  12  kHz  since  the  averaged  human 
HRTFs  atop  there,  and  little  or  no  data  on  HRTFs  have  been  collected  above 
chat  frequency.  The  results  suggest  that  the  playback  aystam  accurately 
reproduces  the  transformation  of  the  free-field  sound  by  the  head,  torso,  and 
pinnae. 
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TABLE  4 


Sound  Praaaura  Laval  Output  (In  dBA) 
at  tha  La ft  and  Right  Haadphonaa 
aa  a  Function  of  Input  Laval  and  Typa  of  Signal 


Input  Typa  of  inoue  aianal 

Haadphona  laval  Whit  a  Z  kkt  nigh-  1  Itkz  low- 


channal 

■ICI2SH 

paaa  nolea 

paaa  noiaa 

90 

34.3 

33.0 

34.0 

laft 

103 

68.3 

68.3 

63.0 

114 

78.3 

78.0 

74.0 

90 

54.3 

35.0 

34.0 

right 

105 

68.3 

68.3 

63,0 

114 

78.3 

78.0 

74.0 
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Figure  27.  Amolitude  response  of  Che  audio  playback  system. 
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4.4  Real-Time  Control  of  Sound  Direction 


4.4.1  Interface  with  the  3-Space  Tracker  System 

The  sampling  rate  of  the  3-Space  tracking  system  is  the  rate  limiting 
factor  in  correcting  for  the  direction  of  sound  when  the  subject  moves  his  or 
her  head.  When  only  head  position  is  measured  (one  station  active)  the 
sampling  rate  of  the  3-Space  tracking  system  is  60  Hs.  Informal  tests,  where 
rapid  head  movements  were  made,  demonstrated  that  the  location  of  the  sound 
remained  stationary  with  a  60  Ht  sampling  rate.  However,  when  finger  and  head 
position  were  measured  at  the  same  time  (two  stations  active,  where  the 
sampling  rate  of  the  3-Space  tracking  system  was  30  Ht).  correction  of  sound 
position  lagged  behind  the  rapid  movement  of  the  head.  In  effect,  the  sound 
did  not  remain  stationary  with  both  stations  active.  To  eliminate  this  slow 
response  in  correcting  sound  location,  a  strategy  of  enabling  only  one  station 
at  a  time  was  adopted. 

At  the  beginning  of  each  teat  trial  the  head  station  was  enabled,  so  that 
the  location  of  the  sound  could  be  corrected  for  head  movements.  The  test 
subjects  were  instructed  to  identify  the  apparent  direction  of  sound  by 
pointing  the  response  indicator  in  that  direction  and  depressing  the  consent 
switch.  When  the  switch  closure  was  sensed  by  the  PC,  the  finger  station  was 
enabled  and  the  head  station  was  disabled.  After  allowing  settling  time  (100 
msec)  for  the  change  in  stations,  the  finger  station  was  sampled  twice  and  the 
second  sample  was  kept.  The  finger  station  was  then  disabled  and  the  head 
station  was  enabled  to  prepare  for  the  next  trial. 

4.4.2  Determination  of  Relay  and  Attanuator  Values 

In  order  to  keep  the  location  of  the  sound  stationary  as  the  head  moves, 
each  report  of  head  position  by  the  3-Space  Tracker  is  followed  by 
calculations  in  the  PC  executive  routine  to  compensate  for  head  translation, 
aa  described  in  Section  3.3.3.  These  calculations  produce  the  new  azimuthal 
location  of  the  sound  relative  to  the  manikin,  which  Is  needed  in  order  to 
make  the  sound  appear  stationary  to  the  listener  wearing  headphones.  The  new 
azimuth  value  is  ULed  as  an  address  to  a  table  of  3600  sets  of  relay  numbers 
and  attenuator  values,  called  the  "speaker  look-up  table".  Each  entry  in  the 
table  represents  the  relays  and  the  attenuator  values  which  produce  a  sound  at 
a  given  one-tenth  degree  position  in  the  loudspeaker  array. 


If  the  sampling  rate  of  the  3-Spac.e  tracking  system  were  fast  enough  to 
keep  up  with  one-tenth  degree  changes  in  head  position,  then  the  corrected 
values  obtained  from  the  3-Spaee  tracker  could  be  used  directly  to  move  the 
sound  around  the  manikin.  Unfortunately,  the  maximum  sampling  rate  of  the  3- 
Space  Tracker  la  too  alow  to  keep  up  with  one-tenth  degree  changes  in  head 
position  during  rapid  head  movements.  During  rapid  head  movements,  the 
successive  aslmuths  reported  by  the  3-Space  Tracker  can  differ  by  several 
degress.  Whan  the  sound  was  moved  around  the  manikin  In  increments  of  several 
degrees,  the  sound  appeared  to  the  listener  to  hsvt  an  intermittent,  "choppy" 
quality. 

It  was  necessary,  therefore,  to  develop  e  routine  which  plays  the  sound 
at  intermediate  aeimuth  values  when  two  successive  corrected  values  from  the 
3-Space  Tracker  differ  by  more  then  a  threshold  value.  For  example,  if  two 
successive  values  reported  by  the  tracker  differ  by  2  degrees,  the  sound  could 
be  transitioned  between  the  two  positions  in  increments  of,  say,  .3  degrees. 
Informal  taste  were  conducted  with  experienced  listeners  to  determine  an 
Increment  else  which  produced  the  perception  of  a  stationery,  yet  smooth, 
sound  during  the  fastest  heed  movements  expected  in  the  SAL  system.  It  was 
found  that  moving  the  sound  in  .4  degree  Increments  produced  a  perceptually 
smooth  sound  which  appeared  to  remain  • tat  ionary  for  reasonably  rapid  head 
movemente. 

The  algorithm  for  transitioning  the  sound  between  adjacent  loudspeakers 
as  it  moved  around  the  manikin  wee  also  modified  to  further  Improve  the 
smoothness  of  the  eound  during  heed  movements.  The  original  algorithm  used  a 
linear  mixing  of  adjacent  speaker  voltage  levela.  Thus,  to  produce  a  sourco 
which  appeared  to  come  from  midway  between  two  loudepeakere,  the  adjacent 
loudapeakmre  were  each  set  at  30  percent  of  their  minimum  attenuation 
values.  This  linear-voltage  algorithm  was  bated  on  the  results  of  phaaor 
enelyeia  of  stereophonic  listening  ( c f . ,  Bauer,  1961).  However,  the  phasor 
analysis  applies  to  only  low  frequency  (<1300  He)  sounds,  since  the  auditory 
syetem  uses  phaae  information  only  in  that  region.  For  broadband  sounds,  It 
ie  possible  that  a  more  appropriate  algorithm  is  a  constant  RMS  voltage  (l.e. 
constant  total  power)  relationship.  In  this  algorithm,  the  voltage  applied  to 
adjacent  loudspeakers  is  proportional  to  the  sine  or  cosine  of  the  angle  of 
the  apparent  source  location  relative  to  the  reel  loudspeaker  locations.  When 


Chis  algorithm  was  applied,  the  sound  heard  through  the  headphones  appeared 
much  smoother  during  head  movement  then  it  did  with  the  linear-voltage  rule* 
The  sine-cosine  algorithm  was  therefore  incorporated  Into  the  speaker  look-up 
table  and  governed  the  movement  of  sound  around  the  manikin. 

4*5  Psychological  Fidelity  Experiment 

t 

i 

4.5.1  Introduction 

An  experiment  was  conducted  in  order  to  evaluate  the  psychological 
fidelity  of  the  SAL  apparatus,  i.a.,  to  quantify  differences  between 

localization  in  the  SAL  environment  as  opposed  to  normal  free-fleld 

localisation.  Complete  paychologlcal  fidelity  would  mean  that  the  SAL 

apparatus  is  perceptually  equivalent  to  normal  localisation,  though  not 

nacesaarily  physically  equivalent*  The  SAL  apparatus  simulates  the  physical 
conditions  in  normal  localisation  by  uaing  a  technique  made  up  of  three 
principal  componente* 

First,  sound  sources  at  azimuths  between  whole-number  multiples  of  10 
degrees  are  simulated  by  activating  two  adjacent  loudspeakers 
simultaneously.  Such  sources  are  called  "phantoms",  as  opposed  to  "real" 
sources,  which  represent  a  single  loudspeaker  and  were  located  at  whola-nunbar 
multiples  of  10  degress.  For  any  achievable  phyalcal  set-up,  there  will 

always  be  soma  physical  difference  between  the  sound  produced  by  two 
loudspeakers  radiating  simultaneously  (i.a.,  a  phantom)  and  that  produced  by  a 
single  loudspeaker.  Fortunately,  physical  identity  is  not  required;  It  is 
only  necessary  that  phantom  and  real  sources  sound  the  sama  to  the  human 
listener. 

Second,  in  the  SAL  facility  the  acoustic  cues  for  localization  era 
produced  by  radiating  the  KEHAR  manikin,  transducing  the  sounds  at  the 
entrance  to  the  manikin's  ear  canals  into  electrical  signals,  and  amplifying 
and  reproducing  the  sounds  through  headphones.  It  is  of  interest  to  determine 
to  what  extent  the  manikin/playback  system  is  perceptually  equivalent  to  the 
listener's  own  ears  during  normal,  fraa-fleld  localization. 

Third,  In  order  to  make  a  sound  appear  to  be  stationary  for  the  liataner 
wearing  headphones  in  the  SAL  facility,  the  direction  of  the  sound  source 
relative  to  the  manikin  must  be  changed  in  real  time  to  compensate  for  the 
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listener's  head  movement.  For  example,  as  the  lister, er  rotates  his  or  her 
head  to  the  right,  the  sound  impinging  on  the  manikin  must  be  moved  an  equal 
amount  to  the  left.  Of  necessity,  the  mannur  in  which  the  sound  is  moved 
around  the  manikin  only  approximates  the  manner  in  which  a  stationary  sound 
moves  relative  to  the  head  in  normal  fraa-fleld  listening  conditions.  Tt  is 
of  Interest  to  determine  whether  the  method  used  to  compensate  for  the 
listener's  head  movement  in  the  SAL  facility  (see  section  A. A. 2)  is  the 

perceptual  aquivalent  of  a  stationary  source  in  free-field  listening 
conditions. 

In  the  psychological  fidelity  experiment,  localisation  accuracy  and 
response  time  were  measured  under  conditions  which  used  various  combinations 
of  the  three  components  of  the  SAL  simulation.  In  addition,  a  control 

condition  was  run  which  involved  no  simulation,  i.e. ,  this  condition  involved 
normal,  free-field  localization  of  a  single  (reel)  source.  This  allowed  an 
evaluation  of  the  overall  fldality  of  the  SAL  system  and  the  identification  of 
any  of  tha  three  major  components  of  the  simulation  responsible  for  departures 
from  complete  psychological  fidelity. 

A.S.2  Method 

A. 5.2.1  Experimental  Design 

Each  of  4  llstanars  wars,  tasted  in  six  conditions  (3  source  types  by  2 
listening  conditions)  repraaentad  in  Tabla  3.  A  white  nolee  stimulus  was 
presented  either  to  tha  unaided  ears  (normal,  free-field  conditions)  or 

dlchotlcally  vis  headphonas,  with  localization  cues  simulated  by  using  the 
manikin.  Thrat  types  of  sources  were  used:  real  sources  (at  10  degree 
intervale),  phantom  sources  at  2  degrees  from  the  nearest  real  source,  and 
phantom  sources  at  5  degrees  from  the  nearest  real  source. 

For  two  listeners,  the  head  remained  stationary  during  localization. 
Another  pair  of  lliteners  was  encouraged  to  move  their  heads  during 

localization  and,  in  tha  simulated  cues  condition,  the  sound  was  moved  about 
the  manikin  to  compensate  for  the  listener's  htad  movements. 

Tha  dependent  measures  ware  localization  accuracy  (apparent  source 
azimuth  minus  actual  source  azimuth)  and  response  time.  The  response  time  was 
measured  with  millisecond  accuracy,  from  tha  onset  of  the  sound  until  the 
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Mean  Localization  Error  in  the  Psychological  Fidelity 
Experiment  as  a  Function  of  Type  of  Source  and 
Head  and  Listening  Conditions 


Overall  source  Means  #  1.654  2.216  1-304  1.725 


subject  pressed  Che  consent  switch  to  indicate  that  he/she  was  pointing  at  the 
sound. 


4, 5.2. 2  Procedure 

In  the  fixed  head  condition,  the  listener  was  instructed  to  place  his  or 
her  chin  in  a  chin  rest  and  to  keep  Che  head  stationary.  The  chin  rest  was 
mounted  on  the  platform  which  held  the  listener's  chair.  In  both  head 
conditions,  the  center  of  the  subject's  head  wee  positioned  directly  under  the 
3-Space  Tracker  source.  The  chair  was  adjusted  so  that  the  listener  assumed  a 
comfortable  posture. 

In  the  unaided  (free-field)  listening  conditional  the  listener  sat  in  the 
center  of  the  loudspeaker  array  in  place  of  the  manikin.  The  listener  was 
seated  such  that  the  entrance  to  his  or  her  ear  canals  was  at  the  same  height 
as  the  center  of  the  loudspeaker  assemblies.  The  subject's  chair  was  adjusted 
so  that  the  center  of  the  head  was  at  the  canter  of  the  loudspeaker  arrpy.  In 
the  simulated  cuaa  (headphone)  conditions,  the  listener  was  seated  in  the 
ajolnlng  "subject"  room,  and  wore  the  helmet  with  the  headphones.  The  helmet 

strap  was  fastened  and  adjusted  so  chat  the  headphones  were  snug  against  the 

pinnae,  The  listener  was  positioned  such  that  his  or  her  head  was  in  the  same 
relative  position  occupied  by  the  manikin's  head  in  the  modal  room. 

In  all  testing,  the  listener's  chair  was  surrounded  by  a  circular,  opaque 
curtain  in  order  to  block  the  view  of  the  loudspeakers  end/or  cornera  of  the 
room.  The  listener  was  seated  at  the  center  of  the  circle  described  by  the 
curtain  which  had  a  radius  of  78  Inches.  The  room  Illumination  was  Just 
bright  enough  to  read  comfortably,  and  the  listener  was  not  blindfolded. 

In  both  experiments,  the  listener  indicated  Che  apparent  direction  of  the 
sound  by  pointing  the  arm  and  hand,  in  which  the  response  indicator  was 

held.  A  magnetic  sensor  mounted  on  the  Indicator  was  used  to  measure  Its 

position  in  space  as  described  in  Section  3.3,4,  The  listener  was  instructed 
to  point  the  indicator  in  the  apparent  direction  of  the  sound  as  accurately  as 
possible,  and  to  press  the  consent  switch,  also  mounted  on  the  response 
indicator,  while  holding  the  response  indicator  steady.  The  subject  was  also 
told  to  use  che  right  hand  for  sounds  on  the  right  side  and  the  left  hand  for 
sounds  on  Che  left,  and  to  hold  the  response  indicator  out  at  arm's  length. 
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On  each  test  trial,  a  sound  was  presented  at  one  of  24  different 
azimuths,  as  shown  In  Table  6*  Forty-eight  test  trials  were  run  in  each  of  10 
blocks,  with  brief  rest  periods  between  blocks.  Bach  of  the  24  azimuths  was 
presented  twice  in  each  block  In  one  of  five  pseudorandom  orders.  Five  blocks 
with  different  pseudorandom  orders  were  tested  in  the  first  test  session  for 
each  listener.  The  listener  returned  for  a  second  session,  in  which  the  same 
five  blocks  of  azimuths  were  run  in  a  different  random  order.  Each  listener 
was  run  in  one  practice/warm-up  block  of  10  test  trlalo  at  the  beginning  of 
each  session. 

Since  it  would  have  been  difficult  to  move  the  circular  curtain  from  room 
to  room  between  test  sessions,  listeners  were  tested  in  the  model  room  In  the 
first  session  and  in  the  subject  room  in  the  second  session.  Thus  all 
listeners  ran  in  the  unaided  (frea-fleld)  condition  in  the  first  session  and 
in  the  simulated  cue  (headphone)  condition  in  the  second  session.  Since  tha 
localization  task  is  simple,  there  should  be  no  additional  improvement  in 
performance  after  the  warm-up  trials,  and  therefore  no  differential  practice 
affect  for  Session  1  versus  Session  2.  If  this  assumption  is  correct, 
performance  in  the  unaided  condition  should  not  improve  over  test  blocks. 

All  subjects  were  given  a  typed  statement  which  described  the  purpose, 
nature,  risks,  and  benefits  of  the  experiment,  and  the  experimenter  also  read 
the  statement.  aloud  to  the  subject.  A  standard  audiometric  test  (ANSI  1969) 
was  then  administered.  Subjects  with  greater  than  30%  hearing  loss  were  not 
used  in  the  experiments.  The  30%  criterion  was  computed  by  taking  30%  of  tha 
difference  between  the  threshold  for  young  adults  with  normal  hearing  and  the 
threshold  for  feeling  for  each  frequency  tested  (cf.,  Slvlan  6  Whits,  1933; 
Bekesy,  1960). 

The  headphone  channels  were  bslanced  at  68  dBA  at  tha  beginning  of  each 
session  which  involved  dlchotic  presentation.  The  signal  laval  at  the 
manikin's  ears  (or  the  listener'a  ears  in  the  free-fleld  listening  conditions) 
was  set  to  54  dBA. 

Each  test  trial  began  with  the  Illumination  of  a  light  at  the  0  degree 
azimuth  position  at  the  height  of  the  subject'*  ears.  The  light  indicated 
that  the  system  was  ready,  and  the  subject  was  instructed  to  press  the  consent 
switch  when  he/she  was  ready  for  the  trial  to  begin.  The  subject  wae  alno 
Instructed  to  turn  hia/her  head  ao  that  he/ahe  was  looking  directly  at  the 
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TABLE  6 


Azimuths  at  which  Sounds 
were  Presented  in  the 
Psychological  Fidelity  Experiment 


Region 

(Degrees) 

Side 

Tvne  o*  Source 

Real 

2  Degree 
Phantom 

5  Degree 
Phantom 

0 

L 

0 

358 

355 

R 

0 

2 

5 

45/315 

L 

320 

318 

315 

R 

40 

42 

45 

90/270 

L 

2  70 

272 

275 

R 

90 

88 

85 

135/225 

L 

230 

228 

225 

R 

130 

132 

135 

ready  light.  The  system  then  sampled  the  subject's  head  azimuth  until  two 
successive  azimuths  within  +5  degrees  azimuth  were  obtained  at  least  0.5  sec 
apart  In  time.  The  light  remained  on  until  the  subject  met  the  head 
orientation  criterion. 

When  the  head  orientation  criterion  had  been  met,  the  response  timer  was 
started  and  the  sound  was  turned  on.  The  sound  remained  on  until  the  consent 
switch  was  pressed  again.  At  thee  time,  the  PC  automatically  read  the 
response  timer,  measured  the  position  of  the  response  indicator,  and  computed 
the  apparent  direction  of  the  sound.  The  sound  was  than  turnwd  off,  and  the 
apparent  sound  direction  and  th«j  raaponaa  time  were  written  to  disk.  Before 
the  next  trial  began,  an  intertrial  of  1.0  sac  was  imposed. 

4. 5.2.3  Subjects 

The  subjects  were  three  male  employees  of  the  Georgia  Tech  Research 
Institute  and  one  female  who  waa  not  an  amployee.  Subjects'  agas  ranged  from 
23  to  43  years.  Subjects  were  paid  for  thalr  participation. 

4.5.3  Results 

The  data  records  of  the  psychological  fidelity  experiment  Included  three 
numbers  of  Interest  for  each  test  trial:  (1)  the  accual  azimuth  at  which  the 
sound  was  presented,  (2)  the  perceived  (indicated)  azimuth  of  the  sound,  based 
on  the  subject's  pointing  response,  and  (3)  the  response  time.  Localization 
accuracy  was  computed  by  subtracting  the  actual  sound  azimuth  from  the 
Indicated  sound  azimuth.  If  the  difference  was  greater  than  180  degrees,  then 
380  degrees  was  subtracted  from  It  to  form  the  localization  accuracy.  If  the 
difference  was  less  than  -180  degrees,  360  degrees  was  added  to  form  the 
localization  accuracy.  These  adjustments  prevented  misinterpretation  of  test 
trials  on  which  the  actual  sound  azimuth  and  the  indicated  azimuth  were  on 
opposite  sides  of  the  discontinuity  at  360  degrees  azimuth. 

Examination  of  the  raw  data  and  debriefing  of  the  subjects  revealed  that 
various  conditions  rendered  the  data  erroneous  on  a  small  number  of  test 
trials.  The  first  condition  was  a  malfunction  in  the  re-inltialization  of  the 
3-Space  Tracker  Just  before  finger  position  was  sampled.  This  caused  exactly 
the  same  indicated  azimuth  to  be  recorded  on  two  successive  trials.  The 
second  of  the  two  trials  was  therefore  excluded  from  the  data  analysis  in  such 
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cas  A  second  condition  involved  the  subject  accidentally  pressing  the 
con  ut  switch  in  the  process  of  moving  his  or  her  arm  to  point  coward  the 
perceived  direction  of  the  sound.  This  condition  produced  large  localisation 
errors.  In  order  to  remove  as  many  of  such  cases  as  possible  from  the  data, 
test  trials  on  which  localization  error  exceeded  60  degrees  were  excluded  from 
the  data  analysis. 

On  a  few  trials,  the  subject  pressed  the  consent  switch  to  extinguish  the 
ready  light,  and  the  ready  light  did  not  go  out  immediately  (because  the 
subject  had  not  met  the  "nose-on  criterion").  The  fact  that  the  light  did  not 
go  out  caused  the  subject  to  believe  that  the  twitch  had  not  bean  pressed 
firmly  enough;  hence  the  subject  pressed  the  switch  a  second  time.  On  a  few 
trials,  the  nose-on  criterion  had  been  met  and  the  sound  had  come  on  by  the 
time  the  second  switch  closure  was  sensed.  This  produced  very  short  response 
tlmss  (0.1  to  0.3  sec.).  Examination  of  the  raw  data  showed  that  valid 
response  times  were  always  greater  than  0.7  sec.  Therefore,  test  trials  for 
which  reported  response  time  was  less  than  0.5  sec  were  excluded  from  the 
data. 

Table  7  shows  the  number  of  test  trials  remaining  after  trials  meeting 
any  one  or  more  of  the  above  criteria  were  excluded.  The  number  of  excluded 
trials  was  15  out  of  a  total  of  1920  observations,  l.e,,  less  than  1  percent 
of  the  trials.  In  this  table,  trials  on  which  the  sound  was  presented  at 
azimuths  of  355,  358,  and  360  degrees  have  been  pooled  with  trials  for  which 
the  sound  was  presented  at  0,  2,  and  5  degrees. 

Three  types  of  summary  statistics  were  computed  for  valid  trials:  (1) 
the  root-mean  square  (RMS)  localization  error,  (2)  the  arithmetic  mean 
localization  error,  and  (3)  the  mean  response  time.  Mean  localization  error 
reflects  the  direction  (left  vs.  right)  of  average  perceived  sound  azimuth 
relative  to  the  actual  sound  azimuth.  Table  5  shows  mean  error  as  a  function 
of  type  of  source  and  listening  conditions.  Mean  localization  errors  less 
than  zero  indicate  that  the  average  perceived  sound  azimuth  was  less  (to  the 
left  of)  the  actual  azimuth.  In  t  he  head  fixed  condition,  unaided 
localization  appears  to  have  been  biased  slightly  to  the  left,  while 
localization  via  headphones  (SAL  cues)  was  biased  to  the  right.  In  the  moving 
head  condition,  however,  unaided  localization  was  biased  more  to  the  right 
than  was  headphone  localization.  These  effects  probably  reflect  individual 
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differences  in  perception  end/or  response  strategy.  In  terms  of  absolute 
magnitude  of  bias,  the  simulated  cues  condition  with  head  movement  was 
comparable  to  the  unaided  condition  with  head  movements.  Note  that  mean 
localisation  error  is  virtually  identical  for  all  three  types  of  sources  (2 
degree  phantoms,  5  degree  phantoms,  and  real  sources). 

The  RMS  localisation  error  provides  a  measure  of  the  verlabllltv  or 
dispersion  of  percaivad  sound  directions  about  the  actual  sound  azimuths.  The 
RMS  error  reflects  two  sources  of  variability:  (1)  the  dispersion  of 
perceived  asimuths  about  the  maan  perceived  eslmuth,  and  (2)  tha  deviation  of 
the  mean  parcalved  asimuth  from  tha  actual  sound  asimuth. 

Table  8  shews  RMS  localisation  arrar  as  a  function  of  type  of  source  and 
listening  conditions.  For  thrae  of  the  four  subjects,  localisation  error  was 
slightly  lass  In  tha  unaldad  (normal)  listening  condition  than  in  the 
headphones  (simulated  cues)  condition.  For  the  fourth  subject  this  trend  wns 
reversed,  i.e.,  he  made  greater  localisation  errors  in  the  unaided  condition 
than  with  simulated  cues.  Excluding  the  fourth  subject  (for  reasons  explained 
below),  overall  localisation  accuracy  wee  slightly  better  in  the  unaided 
condition  (RMS  error  -  9.466  degrees)  than  with  simulated  cuss  (RMS  error  - 
10.721  degress). 

Table  8  also  shows  a  clear  diffarenca  in  RMS  localisation  error  for  the 
fixed  versus  moving  head  conditiono,  but  only  small  differences  as  a  function 
of  type  of  source.  Thus  2  degree  and  5  dsgrae  "phantom"  sources  were 
localised  nearly  as  accurately  as  "real"  sources. 

Tha  data  of  Table  8  are  also  suggestive  of  an  interaction  of  smell 
magnitude  between  head  condition  end  listening  condition.  Again  excluding 
subject  4,  tha  difference  in  RMS  error  for  unaided  versus  headphone  listening 
when  the  head  la  allowed  to  move  (2.283  degrees)  is  slightly  greater  than  that 
In  the  fixed  heed  condition  (0.741  degrees). 

Mean  response  time  is  shown  as  a  function  of  type  of  source  and  listening 
condition  in  Table  9.  As  was  the  case  with  RMS  error,  the  difference  between 
mean  response  times  for  unaided  versus  headphone  localization  la  relatively 
small  for  three  of  the  four  subjects.  For  subject  number  4,  response  tine  wan 
much  faster  in  the  unaided  condition  then  with  headphones  (1.7  sec  versus  5.1 
sec).  It  Is  notable  that  this  subject  produced  relatively  high  RMS 
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Overall  source  Beans  3.329  3.471  3.297  3.366 


localization  arror  in  this  condition,  suggesting  a  speed-accuracy  trade-off. 
Excluding  thla  subject's  data,  tha  haadphona  condition  produced  alightly 
graatar  raaponaa  tlmaa  overall  than  did  the  unaided  condition  (the  difference 
vaa  0.9  aac). 

With  the  exception  of  subject  number  4  in  the  unaided  condition,  there  la 
a  clear  difference  in  raaponaa  time  for  the  head  fixed  veraua  head  moving 
condition,  with  tha  latter  being  graatar.  4a  wee  the  caae  with  RMS  error, 
there  appeara  to  be  little  or  no  difference  in  raaponaa  time  ae  a  function  of 
type  of  aource  (phantom  veraua  real). 

Again,  the  data  auggeata  an  interaction  between  head  condition  and 
llatenlng  condition.  In  the  unaided  condition  (excluding  subject  4),  heed 
movement  lengthena  raaponaa  tlmaa  by  2.3  aec  relative  to  fixed  head 
locallxation.  In  the  headphone  condition,  however,  head  movement  lengthena 
reaponae  time  by  4.0  aec  relative  to  the  head  fixed  condition. 

Tablea  10  through  12  ahow  the  three  meaaurea  localisation  accuracy  and 
apeed  aa  a  function  of  the  actual  aitlmuth  of  the  sound  aource.  Note  in  Table 
10  for  subjects  l,  2,  and  4  that  mean  localisation  error  tends  to  be  negative 
for  sound  sources  located  on  the  subject's  right  side,  and  positive  for 
aourcea  on  the  subject's  left.  In  other  words,  these  subjects  tended  to 
exhibit  a  bias  (systematic  error)  toward  the  front  (0  azimuth)  position. 

Table  11  shows  RMS  localisation  error  aa  a  function  of  the  actual  aslmuth 
of  the  sound.  As  expected,  localization  error  Increases  with  angular  distance 
from  the  frontal  sagittal  plane  in  the  fixed  head  condition.  The  Increase  in 
error  with  Absolute  Azimuth  eppears  to  be  comparable  for  Che  unaided  and 
headphone  conditions.  For  the  head-fixed  condition,  the  average  difference  In 
RMS  error  from  the  +j35  degree  region  to  the  0  degree  region  was  17.0  degrees 
for  the  unaided  listening  veraua  16.0  degrees  for  headphone  listening. 

Head  movement  appeara  to  eliminate  the  increase  In  RMS  localization  error 
aa  a  function  of  absolute  azimuth  (see  the  lower  half  of  TAble  11).  However, 
another  effect  now  appears.  RMS  localization  error  la  generally  greater  on 
the  sublet's  left  than  on  the  right  side.  The  effect  Is  present  in  both  the 
unaided  and  headphone  listening  conditions.  A  comparison  of  Tables  10  and  11 
suggests  that  this  effect  Is  due  to  greater  systematic  biases  in  responding  to 
sounds  on  the  left.  In  other  words,  the  larger  RMS  errors  on  the  left  side 
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.741  4.804  4.663  3.567  4.622  4.680 


art  not  due  to  greater  variability  of  parcalvad  sound  aslmutha  on  tha  left , 
but  to  a  systematic  bias  to  perceive  sources  on  the  left  as  displaced  from 
their  actual  aslmuth. 

Mean  response  time  la  shown  as  a  function  of  aslmuth  In  Table  12.  The 
Increase  In  response  time  from  tha  0  degree  region  to  the  £135  degree  region 
was  greater  for  the  head  moving  condition  (1.2  sac)  than  for  the  fixed  haad 
condition  (0.6  sec).  There  also  appears  to  be  a  small  interaction  of 
listening  condition  by  aslmuth.  The  increase  from  center  to  extreme  aslmutha 
was  about  0.6  sec  for  headphone  listening,  versus  0.1  sec  for  unaided 
listening  (excluding  subject  4). 

Tables  13  and  14  show  RMS  error  and  responsa  time  for  the  10  blocks  test 
trials  distributed  over  two  days.  The  data  have  been  categorised  by  actual 
sound  a-lmuth;  those  in  front  (H5  degrees)  being  shown  separately  from 
lateral  aslmuths  (35  tn  135  and  225  to  275  degrees).  Since  sources  at  tha 
side  and  back  are  more  difficult  to  localise,  at  least  with  the  head  fixed,  it 
was  anticipated  that  they  would  exhibit  a  greater  practice  effect.  However, 
this  was  not  borne  out.  From  Table  13,  it  can  be  seen  that  there  is  little  or 
no  evidence  of  an  Improvement  In  localisation  accuracy  over  blocks  for  either 
the  frontal  or  the  lateral  aslmutha  In  the  unaided  listening  condition. 
However,  In  the  headphone  listening  condition  there  Is  a  tendency  for  RMS 
localization  error  to  decrease  over  blocks.  Excluding  subject  4,  the  overall 
mean  RMS  error  on  the  last  test  block  of  headphone  listening  (7.6  degrees)  la 
virtually  Identical  to  that  last  block  of  unaided  listening  (7.5  degrees). 
This  is  notable  because  the  headphone  condition  produced  slightly  greater  RMS 
error  than  unaided  listening  when  the  results  wete  averaged  over  test  blocks 
(see  earlier  discussion). 

It  Is  also  notable  that  RMS  error  on  the  first  block  of  headphone 
(simulated  cue)  localization  (block  6)  la  roughly  equal  to  the  RMS  error 
throughout  all  five  blocks  of  tne  unaided  condition  (blocke  l  to  5)  excluding 
subject  4.  These  data  suggest  that  thara  was  neither  a  positive  nor  a 
negative  transfer  effect  In  going  from  the  normal  localization  cuos  in  the 
unaided  condition  to  the  simulated  cues  in  the  headphone  condition. 

Table  14  shows  mean  response  time  for  the  10  tost  blocks  over  two  days  of 
practice.  There  la  a  tendency  for  respottae  time  to  deciaaae  over  teat  blocks 
In  both  the  unaided  and  headphono  conditions.  The  decrease  in  response  time 
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over  test  blocks  is  about  the  same  for  both  listening  conditions  and  for  both 
head  conditions.  On  the  last  test  block  of  each  day  (blocks  5  and  10)  mean 
response  time  was  roughly  1.0  sec  faster  in  the  unaided  condition  as  compared 
to  Che  headphone  condition. 

i 

4.5*4  Discussion  and  Conclusions 

The  results  make  possible  an  evaluation  of  the  three  principal  components 
of  the  SAL  facility,  as  discussed  earlier  (see  Section  4.5*1).  The  major 
results  are  summarised  in  Table  15. 

The  algorithm  for  simulating  sound  sources  at  azimuths  which  fall  between 
the  locations  of  the  loudspeakers  was  successful.  The  difference  in  RMS 
localisation  error  for  "phantom"  as  opposed  to  real  (actual)  sources  was  less 
than  0.5  degree.  In  addition,  RMS  localization  accuracy  was  comparable  for 
phantoms  at  2  degrees  end  at  5  degrees  from  the  nearest  real  source. 

The  SAL  equipment  for  synthesising  the  acoustic  cues  necessary  for 
localisation  was  a',  so  successful.  Whan  the  listener's  head  remained 
stationary,  localization  accuracy  with  simulated  cues  was  nearly  as  good  as  in 
normal,  unaided  listening  conditions.  With  the  listener's  head  fixed,  the  RMS 
localisation  arors  for  simulated  and  normal  cues  were  13.1  and  12.4  degrees, 
respectively.  The  differences  in  mean  reponse  time  for  these  two  conditions 
were  less  than  0.5  sac. 

The  third  major  component  of  the  SAL  system  was  the  method  for  moving  the 
sound  around  the  manikin  in  real  time  in  order  to  compensate  for  the 
listener's  heed  movements.  The  results  for  subject  3  in  Table  13  illustrate 
thla  effect.  In  the  first  test  block  with  simulated  cues,  RMS  error  was  7.1 
degrees,  versus  3.9  degrees  for  the  first  block  of  unaided  localization.  By 
the  fifth  test  block,  localisation  accuracy  was  virtually  Identical  for 
simulated  cues  (RMS  •  3.7  degrees)  and  normal  cues  (RMS  •  3.6  degrees). 

In  summary,  the  SAL  facility  produces  a  high  fidelity  simulation  of 
normal,  frea-field  localization.  The  initial  difference  in  localization 
accuracy  for  simulated  cues  delivered  via  headphones  versus  unaided 
localization  was  about  3  degrees  RMS.  This  difference  decreased  to  virtually 
zero  after  one  session  (5  blocks  of  48  test  trials).  The  overall  difference 
in  mean  response  time  with  simulated  as  opposed  to  normal  cues  was  1.0  sec, 
and  this  difference  was  still  present  after  one  session  of  practice. 
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TABLE  15 


Sunnary  of  Major  Ratulca  of  the  Psychological  Fldtllty  Experiment 


PHANTOM  vs.  REAL  SOURCES 

*  DIFFERENCE  IN  RMS  LOCALIZATION  ERROR  <0.5* 

*  NO  DIFFERENCES  FOR  2s  vs.  5*  PHANTOMS 

*  MEAN  RT's  VIRTUALLY  IDENTICAL 

UNAIDED  vs.  HEADPHONE  LISTENING  (WITH  HEAD  FIXED): 

*  DIFFERENCE  IN  RMS  LOCALIZATION  ERROR  *  0.7* 

*  DIFFERENCE  IN  MEAN  RT  <0.5  SEC 
UNAIDED  vs.  HEADPHONE  LISTENING  (HEAD  MOVING) 

*  RMS  LOCALIZATION  ERROR 


MEAN  RT 


BLOCK 

1: 

3.9*  vs.  7.1* 

BLOCK 

5: 

3.6*  vs.  3.7* 

BLOCK 

1: 

4.7  SEC  vs.  6.3 

BLOCK 

5: 

4.0  SEC  vs.  5.3 
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It  Is  likely  that  response  times  for  the  simulated  cues  condition  would 
have  been  shorter  If  localization  accuracy  had  been  stressed  less  heavily  in 
the  instructions.  That  is,  listeners  can  probably  trade  accuracy  for  speed. 

Further  research  is  needed  in  at  least  three  areas  related  to  SAL. 
First,  research  should  be  undertaken  to  optimize  the  fidelity  of  SAL  cues. 
The  results  of  the  present  research  suggest  that  the  real-time  alteration  of 
the  sound  with  head  movement  is  the  area  where  there  is  greatest  potential  for 
improvement.  Second,  further  study  is  needed  to  evaluate  the  effects  of 
practice  on  localization  performance  with  simulated  cues,  and  the  form  of  the 
speed-accuracy  tradeoff  in  localization.  For  example,  what  is  the  impact  on 
localization  accuracy  of  introducing  incentives  for  localizing  more  quickly? 
In  a  task  in  which  SAL  cues  are  used  to  direct  visual  attention,  what  is  the 
Impact  on  visual  search  time  of  speed  incentives? 

Third,  a  coordinated  program  of  psychoacoustic  research  and  engineering 
development  is  needed  to  develop  an  electronic  synthesiser  of  directional 
audio  signals  which  is  suitable  for  airborne  applications. 


S.O  EXPERIMENTAL  RESEARCH 


5.1  Introduction 

This  research  was  directed  toward  identifying  how  the  characteristics  of 
auditory  signals  affect  their  usefulness  as  directional  cues  in  the  cockpit. 
Two  experiments  were  conducted  in  which  listeners  localised  sounds  on  the 
basis  of  simulated  cues  delivered  via  headphones.  The  character  of  the 

stimulus  iri  the  temporal  and  frequency  domains  was  manipulated.  The  dependent 
measures  were  localisation  accuracy  and  response  time.  In  the  first 

experiment,  the  frequency  composition  of  the  etlmulus  was  varied  by  using 
high-pass  and  low-pass  noise  stimuli  with  various  frequency  cutoffs*  In 

Experiment  II,  Intermittent  noise  stimuli  were  used,  and  burst  duration,  rlso 

time,  and  duty  cycle  were  varied. 

Both  experiments  Involved  repeated  testing  of  experienced  subjects,  i.a., 
a  repeated  measures  design.  The  stimuli  wars  presented  dlchotlcally,  via 
headphones  with  the  localisation  cues  generated  by  the  manikin,  as  described 
in  Section  3.0.  Subjects  were  encouraged  to  move  their  heads  while  localising 
the  sounds,  and  the  direction  of  the  sound  rslatlve  to  the  manikin  was  changed 
in  real  time  and  coordinated  with  head  movement  in  order  to  make  the  sound 
appear  to  be  stationary  to  the  listener.  Subjects  indicated  the  apparent 
azimuth  of  the  sound  by  pointing  their  arm  and  hand  and  pressing  the  consent 
switch  on  the  reeponse  indicator,  as  described  in  Section  4.5.2  for  the 
psychological  fidelity  experiment.  As  in  the  previous  experiment,  the 

subjects  were  screened  for  participation  by  using  the  ANSI  1969  audiometrlc 

test.  Subjects  with  greater  than  30%  hearing  loss  or  abnormally  shaped  pinnae 
were  not  uaed. 

The  experimental  procedure  waa  the  aaua  aa  that  part  of  the  paychologlcal 
fidelity  experiment  in  which  the  aubject  received  simulated  cues  and  waa  free 
to  move  the  head. 

5.2  Experiment  I 

5.2.1  Introduction  and  Method 

The  purpose  of  the  first  experiment  was  to  determine  the  importance  of 

various  parts  of  the  audible  spectrin  for  localization  performance  with 
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simulated  cute.  The  stimuli  Included  white  noise  and  high-  and  low-  pass 
filtered  noise  signals  with  steep  skirts,  as  shown  In  Table  lb.  The  cutoff 

frequencies  of  the  stimuli  were  designed  to  pass  portions  of  the  spectrum 

associated  with  sons  localisation  cuss  while  rejecting  portions  of  the 
spectrum  associated  with  others.  Table  17  shows  the  stimuli  and  the 
associated  localisation  cues,  based  on  previous  research  (see  Section  2.0  for 
s  review  of  the  research). 

Each  of  four  subjects  ran  in  two  sessions,  in  each  of  which  they 
experienced  nine  blocks  of  test  trials.  Etch  block  consisted  of  40  test 
trials.  On  each  test  trial,  the  stimulus  was  presented  at  one  of  8  azimuths 
lti  the  horlsontal  plans:  20,  60,  100,  140,  220,  260,  300,  or  340  degr«ea. 

Each  of  the  8  aslmuths  was  presented  6  times  in  each  block,  in  one  of  three 

different  paaudorandom  orders.  Each  of  the  9  types  of  noise  stimuli  was 
presented  on  one  block  in  each  session,  with  the  order  of  presentation  of  the 
nolee  stimuli  randomised  over  sessions  end  subjacta. 

Each  sassion  laatad  approximately  two  hours.  Each  test  subject  received 
an  initial  block  of  10  practlcs/werim*up  trials  at  the  beginning  of  each 
aeualon.  A  whits  noise  stimulus  was  presented  on  practlca/warm-up  trials. 
Subjects  wars  given  a  2  to  3  minute  rest  period  between  blocks,  during  which 
time  they  could  remove  the  headphones  and  move  about  the  laboratory. 

Thresholds  were  measured  for  each  of  the  nine  stimuli  for  each  subject 
prior  to  tha  experiment  by  the  method  of  adjustment.  During  threshold 
measurment,  tha  stimuli  wars  presented  dichotically  with  tha  sound  at  0 
degress  aeimuth  relative  to  the  manikin,  and  the  sound  direction  was  not 
corrected  for  head  movement.  The  experiment  was  run  with  the  intensity  of 
each  atimulue  adjusted  to  33  dBA  scneatlon  level  for  each  subject. 

The  subjects  were  one  female  and  three  male  employees  of  the  Georgia  Tech 
Research  Inititute  who  were  paid  for  thair  participation.  They  ranged  in  age 
from  23  to  29  years. 

5.2.2  Results 

The  raw  data  for  Experiment  l  were  first  processed  as  discussed  In 
Section  4.3.3  to  correctly  interpret  cases  in  which  the  actual  sound  azimuth 
and  the  azimuth  indicated  by  the  subject  were  on  opposite  sides  of  the  360,0 
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TABLE  16 

Filtarad  Nolt*  Stimuli 
Used  in  Exparinant  I 


-3dBV 


Cut-off 

Slop#  of 

Typa  of 

f raquancy 

aklrt 

filtarlnt 

:  (dBV/Octava) 

High-paaa 

- -  !  - 

2 

78.3 

4 

69.6 

1 

-79.0 

2 

-72.8 

4 

-66. 1 

Low- past 

8 

-59.4 

12 

-26.8 

16 

-29.3 
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TABLE  17 


Stimuli  for  Experiment  L 

and  Localisation  Cu*a  Available 


typo  of 

Cut-Off 

Cuaa  Aval labia 

Hois  a 

rraquancy  (kite) 

iiD  IAD  torso  Pinna* 

Low-Fas* 

4 

✓ 

✓  / 

/  /  ✓ 

8 

✓  /  ✓  ✓ 

12 

✓  /  ✓  ✓ 

18 

✓  ✓  /  / 

Whit* 

✓  ✓  ✓  ✓ 

degree  discontinuity.  Th«  raw  data  wars  than  further  Ciltarad  to  ramova  taat 
trials  on  which  any  one  or  more  of  three  conditions  vaa  met.  The  three 
conditions  were  response  time  less  than  0.5  sac,  localization  error  greater 
than  60  degrees,  or  exactly  the  same  indicated  azimuth  reported  as  on  the 
InmiadiataLy  preceding  trial.  The  rationale  for  these  exclusions  is  discussed 
in  Section  4.3.3.  Forty-five  of  2,880  test  trials,  or  1.5  percent,  wars 

excluded  from  the  data. 

As  in  the  psychological  fidelity  experiment,  three  summary  measures  wars 
computed:  mean  localisation  error,  root  mean  square  (RMS)  localization  error 
about  the  actual  sound  azimuth,  and  mean  response  time.  Each  measure  was 
computed  for  each  of  576  cells  representing  all  possible  combinations  of  9 

stimulus  types,  8  azimuths,  2  days  of  practice,  and  4  subjects.  The  mean 

across  subjects  for  each  measure  was  then  computed  for  each  of  144  conditions 
ohown  in  Table  18.  Multiple,  planned  comparisons  analysis  of  variance  (ANOVA) 
was  used  to  test  hypotheses  concerning  the  RMS  error  and  response  time  data. 

The  planned  comparisons  wars  designed  to  contrast  localisation 

performance  for  conditions  in  which  different  localisation  cues  wars  avallabls 
(see  Table  17).  Additional  comparisons  wars  also  mads  to  exaolns  ths  effect 
of  stimulus  bandwidth,  practice  (Day  1  versus  Day  2),  type  of  nolsa  stimulus, 
and  source  azimuth.  The  complete  set  of  comparisons  is  shown  in  Table  19. 

The  RM8  localization  error  results  are  shown  in  Tables  18,  20,  and  21. 
The  overall  mean  RMS  error  la  remarkably  similar  for  the  nine  types  of  noise 
stimuli.  The  only  statistically  reliable  contrast  (F(l,3)  ■  10.64,  p  <  .05) 
wafl  that  between  white  noise  end  2  kHz  high-pass  noiee  (the  "ITD"  contrast). 
This  result  suggests  that  the  presence  of  low  frequencies  (<2  kHz),  which 
provide  lnteraurel  time  difference  (ITD)  cues,  facilitates  localisation  with 
simulated  cuee. 

Given  that  the  ITD  contrast  was  significant,  It  is  surprising  that  the 
"ITD  +  torso”  contrast  was  not  significant.  In  fact,  ehe  4  kHz  high-pass 
noise  produced  a  smaller  RMS  error  (i.a.,  more  accurate  localization)  than  did 
white  noise.  The  Inconsistency  of  these  two  contrasts  and  the  variability  in 
RMS  error  across  noise  types  suggests  that  the  significant  ITD  contrast  may  be 
spurious  (i.e.,  a  type  I  error). 
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RMS  Localization  Error  as  a  Function  of 
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Referring  again  eo  Table  20,  RMS  error  on  Day  2  Is  consistently  smaller 
than  .on  Day  l.  The  Day  effect  approached  significance  (F(l,3)  -  5.13,  p  - 
.1084).  This  result  Is  consistent  with  the  practice  effect  seen  In  the 
headphone  condition  of  the  psychological  fidelity  experiment.  Recall  that  RMS 
localisation  error  decreased  over  practice  blocks  for  the  headphone  (aimulatad 
cues)  condition.  The  feet  that  there  was  no  decrease  in  RMS  error  in  the 
unaided  listening  condition  suggeate  that  the  improvement  In  localisation 
accuracy  le  due  to  adaptation  to  the  aimulatad  cues  rather  than  Improvement  of 
the  pointing  response. 

Table  21  shows  RMS  error  as  a  function  of  Day  end  Aaimuth.  There  was  a 
significant  effect  of  the  Aaimuth-side  contrast  (F(l,3)  •  1.409,  p<.05), 
reflecting  greeter  error  in  localising  sounds  on  the  listener's  left.  Among 
the  simple  effect  contrasts,  there  wee  e  significant  Asimuth-side  affect  on 
Day  1  <F< 1,3)  ■  23.49,  p<  .03),  but  not  on  Day  2. 

Table  22  ehowe  mean  localisation  error  (systematic  bias)  as  e  function  of 
Day  end  Aslmuth.  These  date  suggest  that  the  Side  effect  noted  above  on  Day  l 
is  due  to  systematic  localisation  error.  For  sounds  at  asimuthe  of  220  to  340 
degrees  on  Day  l,  the  listeners  pointed  an  average  of  7  to  10  degrees  to  the 
left  of  the  actual  sound  position.  This  systematic  error  was  greatly  reduced 
on  the  second  day.  These  reeulte  suggest  that  aubjecta  may  have  bean  less 
accurate  In  pointing  with  their  left  hand  then  with  their  right.  Examination 
of  individual  subject's  date  showed  that  the  effect  wee  present  for  all  4 
subjects,  one  of  whom  was  left-hsndsd.  Recall  that  subjects  were  instructed 
to  point  with  the  left  hand  when  the  sound  occurred  on  the  left  side,  and  with 
the  right  hand  for  aounda  on  the  right  side. 

The  only  other  significant  comparisons  among  the  simple  effect*  were  the 
Asimuth-side  effect!  for  the  following  stimuli!  4  kHz,  low-pass  noise  (F(l,3) 
■  12.67,  p  <  .03);  16  kHs  low-pass  noise  (F(l,3)  ■  39.08,  p  <  .05);  white 
noiee  (F(l,3)  ■  18,32,  p  <  .03);  end  2  kHs  high-pass  nolsa  (F  •  31,4  5,  p< 
.03). 

Tables  23  and  24  show  mean  localisation  response  time  se  a  function  of 
Noiee  type,  Aslmuth,  and  Day.  Tha  only  significant  contrast  was  the  effect  of 
Day  (F(l,3)  •  18.27,  p  <  .03).  Mean  response  time  averaged  0.8  sec  greater  on 
Day  1  then  on  Day  2. 
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V 


Response  Tlse  as  a  Fanction  of  Stianlos 
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5.3  Experiment  II 


5.3.1  Introduction 

The  purpoaa  of  this  experiment  was  to  axamlna  the  effects  on  localization 
performance  of  rlee  time,  repetition  rate  (duty  cycle),  and  burst  duration  for 
trains  of  high-  and  loi^frequency  noise  bursts.  From  studies  of 
lateralization,  it  appears  that  the  importance  of  rise  time  ae  a  cue  depends 
on  signal  duration.  The  longer  a  signal  lasts,  the  more  ongoing  lnteraural 
time  and  amplitude  differences  appear  to  contribute  to  lateralization.  As  the 
signal  duration  approaches  300  msec,  rise  time  no  longer  has  an  effect  on  the 
letarallzatlon  just-noticeable  difference  (jnd)  (Mills,  1972;  Durlach  & 
Colburn,  1978). 

As  discussed  in  Section  2.0,  the  cues  for  horizontal  plana  localization 
differ  at  high  and  low  frequency.  At  low  frequency,  ongoing  ITD  is  the 
dominant  cue.  At  high  frequency  several  cuos  are  available,  including  IAD, 
transient  and  envelope  time  differences,  and  monaural  cues.  Hence,  it  was  of 
interest  to  study  the  trade-off  between  rise  time  and  burst  duration 
separately  at  high  and  low  frequency. 

The  repetition  rata  variable  is  of  interest  primarily  for  high  frsquancy 
stimuli.  Ae  noted  in  Section  2.0,  the  auditory  system  is  sensitive  to 
lnteraural  time  differences  associated  with  the  envelope  of  complex  high- 
frequency  sounds.  The  present  experiment  examined  the  effect  of  envelope 
frequency  (l.e.,  repetition  rate)  cn  localization  oerformance  at  both  high  and 
low  frequencies. 

A  four-factor,  completely  crossed  design  was  used.  The  four  factors 
included:  (1)  two  types  of  noise  bursts  (2  kHz  high-pass  and  1  kHz  low-pass), 
(2)  two  rise  times,  (3)  two  burst  durations,  and  (4)  two  duty  cycles.  The 
values  of  rise  time,  burst  duration,  and  duty  cycle  used  are  shown  In  Figure 
28.  The  combinations  of  duty  cycle  and  burst  duration  employed  resulted  In 
three  values  of  repetition  rate,  ae  shown  In  the  right-most  column  of  Figure 
28.  Notice  that  repetition  rate  Is  the  Interaction  of  duty  cycle  and  hurst 
duration.  The  design  of  this  experiment  allows  an  Independent  examination  of 
the  effects  of  duty  cycle,  burst  duration,  and  repetition  rate. 
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Figure  28.  Parameters  of  the  Intermittent  noise  stimuli  used  in  Experiment  II. 


As  in  Experiment  I,  there  mey  be  a  main  effect  of  noise  type,  reflecting 
the  efficiency  of  the  localization  cues  available  with  each  stimulus  (see 
Table  17).  Thera  should  also  be  an  effect  of  the  laterality  (l.a.,  absolute 
aelmuth)  of  the  sound  source. 

3.3.2  Method 

Bach  of  four  subjects  ran  in  two  sessions,  in  each  of  which  they 
experienced  16  blocks  of  teat  trials.  Bach  block  conaiated  of  30  test 
trials.  On  each  teat  trial,  the  stimulus  was  presented  at  one  of  6  apparent 
aslmuthsi  20,  60,  100,  260,  300  or  340  degrees.  Bach  of  the  6  aslmutha  was 
presented  5  times  in  each  block,  in  one  of  8  different  pseudorandom  orders. 
Bach  of  the  16  stimuli,  defined  by  combinations  of  Noise  type,  Biss  time,  Chity 
cycle,  and  Burst  duration,  was  presented  during  one  block  in  each  test 
session.  The  order  of  presentation  of  the  stimuli  was  randomised  over 
subjects  and  sessions* 

Bach  session  lasted  approximately  two  and  one-half  hours.  Bach  subject 
received  an  initial  block  of  10  practice /warm-up  trials  at  the  beginning  of 
each  session.  The  stimulus  for  the  practice  block  was  the  same  as  that  for 
the  first  block  of  the  session*  Subjects  were  given  a  2  to  3  minute  rest 
period  between  blocks,  during  which  time  they  could  remove  the  headphones  and 
move  about  the  room. 

Thresholds  were  determined  by  the  method  of  adjustment  f or  each  of  the  16 
stimuli  for  two  of  the  four  subjects*  Tor  these  two  subjects,  the  dlfferencee 
in  thresholds  ee  a  function  of  Rise  time,  Burst  duration,  and  Duty  cycle  wore 
qvlte  consistent,  as  shown  in  Teble  23.  Therefore,  for  the  remaining  two 
subjects,  the  thresholds  were  measured  for  only  four  of  the  16  stimuli,  and 
thresholds  for  the  remaining  12  stimuli  were  estimeted.  The  thresholds  for 
these  12  conditions  were  estimated  by  adding  the  "threshold  difference"  values 
shown  In  Table  23,  which  are  based  on  the  messured  thresholds  for  the  first 
two  subjects. 

Threshold  measurements  were  taken  prior  to  the  first  experimental 
session.  During  threshold  measurement,  the  stimuli  were  presented 
dichotlcslly  with  the  sound  at  0  degrees  azimuth  relative  to  the  manikin,  and 
the  sound  direction  wee  not  corrected  for  heed  movement.  The  experiment  was 
run  with  the  Intensity  of  eech  stimulus  adjusted  to  22  dBA  sensation  level  for 
each  subject. 
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Thresholds  for  Experiment  II  Stimuli 
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See  text  for  explanation 


The  subjects  were  three  male  and  one  female  employees  of  the  Georgia  Tech 
Research  Institute,  who  were  paid  for  their  participation.  The  subjects 
ranged  In  age  from  19  to  43  years. 

3.3.3  Results 

As  In  the  previous  experiments,  the  raw  data  wore  processed  and  some 
trials  were  excluded  to  remove  spurious  data  and  account  for  the  discontinuity 
at  360  degrees.  Ttreney  four  trials  of  the  total  3,840,  or  0.6  percent,  were 
excluded  from  the  analysis  reported  herein*  Msan  localisation  error,  RMS 
localisation  error,  and  mean  response  elms  wsrs  computed  for  each  subject  in 
each  of  96  call*  representing  combination*  of  16  stimulus  typsa  by  6  source 
•slmuthe. 

The  RMS  error  and  mean  response  time  data  ware  eubmltesd  to  a  rapsatad- 
maaauraa  ANOVA.  The  ANOVA  modal  included  the  main  affects  of  all  four 
stimulus  variables  (Noise  type,  Rise  elms,  Buret  duration,  and  Duty  cycle) , 
the  llneer  component  of  the  ealmuth  variable,  and  all  two  and  three-way 
interaction*  of  the  foregoing  yarlablea.  Th*  coefficients  for  the  Aaimuth- 

llnaar  contrast  were  +1,  0,  -1,  -1,  0,  and  +1  for  sourca  aslnuth  of  20,  60, 

100,  260,  300,  and  340  degress,  respectively. 

Table  26  show*  mean  localisation  orror  ss  a  function  of  tha  stimulus 

variables  and  aslmuth.  Most  of  the  means  era  quite  small  In  absolute  value, 
indicating  that  there  iaea  relatively  little  systematic  error  in 
localisation.  This  result  also  means  that  th*  RMS  error  measure  raf laces 

primarily  the  dispersion  of  indicated  aslmutha  about  tha  call  meana  rather 
than  the  deviation  of  th*  call  means  from  the  actual  source  aslmutha. 

Tha  ANOVA  for  RMS  error  produced  only  thrsa  significant  affects.  The 
linear  trend  with  absolute  sourca  aalmuth  was  statistically  reliable  (P(l,3)  - 
13.43,  p  <  .03).  Th*  last  row  in  Table  27  shows  that  RMS  localisation  error 
was  smallest  at  ±20  dagraas  and  lncraasad  monotonically  to  ±100  degrees 
aalmuth.  Even  though  th*  aslmuth  affect  is  significant,  the  effect  la  quite 
email.  Tha  RMS  error  increase*  an  average  of  only  about  1.0  degree  from  ±20 
to  ±100  degraaa  aalmuth. 

Tha  three-way  interaction  of  Burst  duration  by  Duty  cycle  by  Azlmuth- 
llneet  approached  significance  (F(l,3)  ■  6.28,  p  <  .10).  Table  28  ehowa  the 
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0.748  2.839  3.436  -0.613  -0.092  -0.855 


TABLE  27 


RMB  Localisation  Error  as  a  Function 
of  Bunt  Duration.  Duty  Cycla,  and 
Sourca  Asimuth  for  Exearlmant  II 


Bum 

duration 
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TABLE  28 

Linear  Component  of  the  Change 
in  RMS  Localisation  Error  with  Source 
Aslauth  as  a  Function  of  Duty  Cycle 
and  Surat  Duration 


Buret 

duration  Duty  cycle  (percent) 

(nsec) _  10  BO 


(2  Hi)*  (10  Hi) ! 

50  -2.055  -2.079 

(0.4  Hi)  (2  Hi) 

250  -3.526  -0.848 


Numbers  in  parentheses  are  the  repetition  rate 
for  each  eignal,  a.g.,  I  or  a  buret  duration  of 
50  nsec  and  10  paroant  duty  cycle,  the  burets 
occurred  two  tines  per  second. 
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linear  component  of  the  change  In  RMS  over  Lzlmuth,  as  a  function  of  Burst 
duration  and  Duty  cycle.  The  negative  scores  indicate  that  RMS  error  was 
greater  at  +100  degrees  asiauth  than  at  +20  degrees.  The  numbers  in 
parentheses  above  each  data  point  are  the  rate  of  repetition  of  the  bursts  for 
each  combination  of  Duty  cycle  by  lure?  duration.  The  lorn  repetition  rate 
(0*4  Ha)  may  have  been  responsible  for  the  greater  atlmuth  effect  in  the  250 
msec  burst,  10  parcant  duty  condition.  Subjacte  reported  that  they  kept  their 
head  oriantad  toward  0  dagraee  aslmuth  until  thay  haard  the  first  buret  on  e 
given  trial.  Sinea  localisation  accuracy  dacraasad  with  aslmuth  relative  to 
the  head,  a  first  burst  at  on  extreme  asiauth  providad  lass  information  than  a 
first  burst  at  a  small  absoluta  asiauth.  Xf  tha  subject  raapondad  befora  s 
sacond  burst  occurrsd,  localisation  should  bs  loss  accurst#  at  axtrsme 
aslmuths.  Tha  fact  that  assn  response  time  to  +100  dagraes  aslmuth  averaged 
only  0,8  sec  greater  than  to  ,+20  dagraas  asiauth  suggests  that  subjacte  often 
raapondad  during  tha  "off  part  of  tha  duty  cycla  (which  lasted  2.25  esc  in 
tha  2S0  burst,  10  parcant  duty  condition). 

Tha  only  other  affact  for  RMS  arror  which  approached  significance  was  the 
thraa-mroy  interaction  of  Rise  time  by  Burst  duration  by  Duty  cycle  (F(l,3)  • 
3.92,  p  <  .10).  Table  29  shows  that,  as  expected,  a  shorter  rise  time 
produces  greater  localisation  accuracy  (smaller  RK8  arror)  for  throe  of  four 
combinations  of  Duty  cycle  end  Burst  duration.  The  shorter  rise  time  was  most 
beneficial  in  tha  230  msec  buret,  10  percent  duty,  which  had  the  lowast  burst 
repetition  rate*  This  affact  is  consistent  with  tha  ganarsl  conclusion  from 
latarallsatlon  studios  that  a  short  rise  time  is  more  baneficlsl  whan  ongoing 
lntaraural  time  and  aaplituda  dlffarancas  era  laae  available  (due  to  the  low 
repetition  rate  In  this  case).  It  la  not  clear  why  the  shorter  rise  time 
failed  to  produce  a  beneficial  affact  in  tha  30  maec  burst,  10  parcant  duty 
condition.  Tha  main  affact  of  rite  tins  mss  In  the  anticipated  direction,  but 
was  not  statistically  reliable. 

4s  mentioned  earlier,  it  was  of  interest  to  study  the  trade-off  (in  terms 
of  localisation  accuracy)  between  rise  tins  and  burst  duration  separately  for 
hLgh  end  low  frequency  stimuli.  The  three  way  Interaction  of  Noise-type  by 
Rise  time  by  Burst  duration  for  RM8  error  did  not  approach  significance 
(F( 1 ,3)  ■  2.33,  p  ■  .2228).  Table  30  shows  tha  RMS  error  data  for  these  three 
variables. 
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TABLE  29 


RMS  Localization  Error  as  a  Function 
of  Rise  Time,  Burst  Duration 
and  Duty  Cycle 


Duty 

cycle 

(percent) 

Burst 

duration 

(nsec) 

Burst 

repetition 

rate 

(Hz) 

Rise 

1 

time 

20 

50 

2 

5.003 

4.734 

10 

250 

0.4 

4.491 

5.095 

50 

10 

4.512 

4.859 

50 

250 

2 

.  4.606 

4r908 

Means:  4.653  4.899 
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TABLE  30 

RMS  Localization  Error  as  a 
Function  of  Rise  Time,  Noise 
Type,  and  Burst  Duration 


Noise 

type 

Burst 

duration 

(msec) 

Rise  time 

1 

(msec) 

20 

Means 

50 

4.621 

4.968 

4.794 

1  kHz 
low-pass 

250 

4.787 

4.853 

4.820 

50 

4.894 

4.625 

4.760 

2  kHz 
high-pass 

250 

4.311 

5.150 

4.730 

It  was  alto  of  Intaraat  Co  determine  tha  effact  of  rapatltion  rata  on 
local Isat ion  accuracy  for  high  and  low  fraquancy  otimull.  Thara  ana  no 
algnificant  affact  of  rapatltion  rata  (Burst  duration  x  Duty  cycla)  overall 
(F(  1 ,3)  «  3.83,  p  -  .1433).  Furthanaorc,  thara  was  no  avldanca  of  any 
diffarantlal  affact  of  rapatltion  rata  fot  high  *  iriua  low  fraquancy  stimuli, 
l.a. ,  no  significant  lntaraction  of  Nolsa  typa  by  Burst  duration  by  Duty  cycla 
(7(1.3)  -  0.43,  p  «  .3599).  Tabla  31  shows  tha  RMS  arror  data  for  thaaa 
varlablas. 

Tha  ANOVA  for  maan  raaponsa  tins  producad  two  significant  affact*  and  ona 
marginally  significant  affact.  Tha  llnaar  trand  with  abaoluta  sourca  aslauth 
was  significant.  (7(1,3)  ■  10.26,  p  <  .03).  Tabla  32  shows  that  raaponsa 
tinea  wars  about  0.8  aac  fastar  on  tha  ivaraga  to  sourcaa  at  +20  dagraas  than 
at  +100  dagraas  aalmuth.  01 van  that  raaponsa  tlaas  wars  ganarally  long  (tha 
ovarall  naan  was  3.5  aac),  tha  dlffaranca  of  O.d  sac  to  localise  In  tha  front 
versus  cha  raarward  aaimuths  seaas  relatively  small.  The  minimum  raaponsa 
time  was  0.714  sac,  tha  maxima  was  7.SB9  aac,  and  tha  standard  deviation  was 
1.753  sac. 

Anothar  significant  dlffaranca  In  maan  response  time  was  for  Nolsa  type, 
aa  shown  In  Table  33.  Raaponsa  times  ware,  on  tha  average,  0.6  sac  fastar  for 
l  kHs  low-pass  nolsa  bursts  than  for  2  kHs  high-pass  nolsa  bursts  (7(1,3)  • 
10.43,  p  <  .03).  It  Is  notable  that  thara  was  a  dlffaranca  in  tha  same 
direction,  although  smaller,  for  naan  response  tlaas  in  Experiment  1.  In  that 
experiment,  the  elms  required  to  localise  a  continuous,  1  kHs  low-pass  noise 
was  about  0.2  sec  lass  than  chat  for  a  continuous,  2  kHs  high-pas*  nolsa. 

A  third  affact  for  mean  raaponsa  time  In  Experiment  II  approached 
significance.  This  was  the  lntaraction  of  Nolsa  typa  by  Duty  cycla  (F(l,3)  ■ 
7.63,  p  <  .10).  As  can  ba  seen  In  Tabla  33,  maan  raaponsa  time  dacraasad  as 
duty  cycle  increased  for  cha  low  fraquancy  noise  bursts,  but  increased  with 
duty  cycla  for  tha  high  fraquancy  nolsa  hursts. 

5.4  Discussion  and  Conclusions 

In  both  Experiments  l  and  II,  there  was  relatively  little  difference  in 
localisation  accuracy  as  a  function  of  stimulus  characteristics  in  tha  time 
and  frequency  domains.  Tables  34  and  35  summarize  the  major  results  of 
Experiments  I  and  II,  respectively.  On  the  basis  of  previous  findings,  it 
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TABLE  31 


RMS  Localization  Error 


Holaa 

as  a  Function 
Burat  Duration, 

Burat 

duration 

of  Noiae  Type, 

,  and  Duty  Cycle 

Duty  cycle  (percent) 

Gro« 

(Mac) 

—55 

(2  Hi)* 

<10  Ha) 

SO 

4,908 

4.680 

1  kSa 
lov-paa» 

2  SO 

<0.4  Ha) 

4.958 

<2  Ha) 

4.682 

so 

<2  Ha) 

4.829 

(10  Ha) 
4.691 

2  kHi 
hlgh-paaa 

2S0 

<0.4  Ha) 

4.628 

<2  Ha) 

4.832 

*Nuabara  in  paranthaaaa  arc  eha  burat  rapatitlon  rataa. 
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TABLE  32 


Mean  Response  Tin*  u  a  Function 
of  Source  Azimuth 

_ Source  azimuth  ( degreee  ) - 

20  60  100  260  300  340 


3.035  3.386  3.721  4.062  3.748  3.145 
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TABLE  33 


Mean  Reeponee  Tina  as  a  Function 
of  Rolan  Typa  and  Duty  Cycle 


Nolan 

WM 

(percent) 

IB 

io 

1  kit 

3.326 

3.094 

3.210 

lov-paaa 

2  kit 

3.684 

3.961 

3.822 

hl|b-p«aa 

4 

Maana 


3.303  3.327 
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TABLE  34 


SuMMry  of  Major  Results  of  Experiment  I 


*  RMS  localisation  Brror 

*  tfhita  no la*  prodoc *d  slightly,  but  significantly,  lass 
error  than  did  2  kite  high-pass  nolsa 

*  Localisation  tended  to  be  slightly  store  accurate  on  the 
second  day  of  practice  then  on  the  first  day 

*  Subject a  localised  source*  on  the  right  side  nor* 

accurately  than  sources  on  the  left  side  on  the  first  day, 
but  there  mss  no  difference  on  the  second  day 

Mean  Raaponae  Tie* 

*  Subjects  responded  significantly  faster  on  the  second  day 
of  practice  than  on  the  first  day 
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TABLE  35 


Sanaa  ry  of  Major  hesulte  of  Experiment  It 


Ml  Localisation  Error 

*  Thara  mi  a  Mall,  hot  significant,  linear  Increase  with 
source  atiaeth 

*  Tha  I  aaae  rlta  tiaa  produce*  laaa  arret  than  did  the  20 
Mae  rlaa  tiaa  for  signals  with  tha  lowest  tapatltion  rata 
<0.4  Ha) 

*  Tha  lncraaaa  in  arret  with  aiiaath  landed  te  be  greater  for 
stlaull  With  lev  rapatitian  rataa  than  fat  stlaull  with 
higher  rapatitian  rataa 

Man  Haaponaa  Tiaa 

'  thara  aaa  a  Mall,  but  ilgnif leant,  linear  lncraaaa  with 

source  asianth 

'  Tha  1  kfla  lev-pa**  nelaa  atiaull  produced  slightly  faatar 
localisation  than  did  tha  2  kite  high-pa**  nolaa  stimuli 
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would  be  expected  that  such  characteristics  would  exert  a  greater  influence  on 
locallaatloo  accuracy  (aaa  lac t ion  2.0  for  a  review).  Howavar,  almost  without 
exception,  tha  listener's  baad  was  raatralnad  In  the  pravloua 
investigations.  Tha  ralatlva  abaanca  of  affaeta  of  stimulus  eharactarlatlca 
In  tha  praaant  experiments  auggaata  that  hand  aovaaant  provldaa  localisation 
cuas  which  obvlata  tha  naad  for  multiple  cuas  prowldad  by  cartaln  daslrabla 
apactral  and  time-domain  eharactarlatlca  (aaa  lactlon  2.0  for  a  review  of  auch 
eharactarlatlca). 

Preedaan  and  Flahar  (1968)  reported  one  of  tha  few  pravloua  atudlea  which 
relate  tha  availability  of  apactral  euaa  to  localisation  accuracy  whan  tha 
head  la  free  to  nova.  They  atudied  localisation  accuracy  with  and  without 
head  aovaaant  and  with  tha  listener's  pinnae  open  or  occluded.  Whan  tha  head 
waa  fixed,  ocdualon  of  tha  pinnae  reduced  localisation  accuracy*  Whan  head 
aovaaant  was  allowed,  localisation  waa  as  accurate  with  occluded  pinnae  as 
with  open  pinnae.  Thus  head  aovaaant  appears  to  greatly  reduce  tha  lapact  of 
atlaulus  characteristics  on  localisation  accuracy,  both  with  olaulatad  cues 
and  In  normal  unaided  localisation* 

Although  tha  overall  affects  of  atlaulus  characteristics  on  localisation 
parforaanca  ware  saall,  soaa  trends  and  Interactions  wars  observed  which  have 
significance  for  tha  design  of  directional  auditory  dlaplays.  In  Experlaant 
II,  tha  overall  affects  of  tha  burst  duration,  duty  cycle,  and  rlsa  tlaa  of 
temporally  Intermittent  stlaull  on  localisation  accuracy  and  response  tlaa 
ware  not  statistically  significant.  However,  there  waa  a  trend  for  shorter 
rise  claas  to  produce  graatar  localisation  accuracy,  particularly  for  low 
raped tlon-rate  stlaull.  Thera  waa  also  a  differential  affect  of  repetition 
rata  as  a  function  of  source  aslauth.  Tha  decrease  in  localisation  accuracy 
with  absolute  aslauth  was  greatest  for  very  slow  repetition  rates,  probably 
because  tha  listener  did  not  welt  for  a  second  burst  before  responding  after 
turning  hla  head  in  tha  general  direction  of  the  sound,  as  explained  in 
Section  5.3.3. 

Varying  stimulus  characteristics  also  had  relatively  little  Impact  on  the 
tlaa  required  to  localize  a  sound.  There  were  no  significant  overall  effects 
of  high  or  low-pass  filtering  continuous  noise  stimuli,  and  no  overall  effects 
of  rise  time,  burst  duration,  or  duty  cycle  for  Intermittent  stimuli. 
However,  for  Intermittent  noise  stlaull,  low-pass  filtering  did  produce  faster 
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response  times  Chan  did  high-pase  filtering.  furthermore,  the  advantage  (In 
Canaa  of  shorter  response  times)  for  low-pass  stimuli  was  accentuated  at  high 
duty  cycles  (slow  repetition  rates). 

There  was  relatively  little  affect  of  absolute  source  azimuth  on 
localisation  accuracy  and  reaponse  tins  In  Experiments  I  and  II.  Fbr 

continuous  noise  stimuli,  there  was  no  effect  of  source  angular  distance  from 
the  frontal  sagittal  plane.  Fbr  intermittent  noise  stimuli,  there  was  a  small 
but  significant  effect  of  absolute  aslmuth,  localisation  error  Increasing 

about  1.0  degree  RMS  for  sources  at  +20  degrees  aslmuth  versus  sources  at  +100 
degrees  aslmuth.  The  decrease  in  localisation  accuracy  with  aslmuth  was 
greatest  (about  1,73  degrees  IMS)  for  vary  low  repetition  rate  stimuli.  There 
was  also  a  small  but  significant  increase  In  response  time  with  absolute 
aslmuth  for  Intermittent  noise  stimuli.  Mean  response  time  Increased  about 
0,8  sac  for  sources  at  +20  versus  +100  degrees  aslmuth. 

The  fast  that  absolute  aslmuth  has  only  small  effects  on  localisation 
accuracy  can  be  attributed  to  head  movements.  In  the  psychological  fidelity 
experiment,  the  head  fixed  condition  produced  an  average  difference  In 
localisation  error  of  16.6  dagraas  RMS  for  the  frontal  region  (+5  to  -3 

dagraaa)  versus  the  lateral  region  (+130  to  +133  degrees).  This  diffarsnee  is 
much  graatsr  than  that  obtained  whan  tha  head  was  frsa  to  move  (2.7 

dagraas).  On  tha  other  hand,  head  movement  does  not  seem  to  be  accountable 
for  the  Increase  In  response  time  with  abaolute  aslmuth.  Tha  fixed-head 
condition  of  the  psychological  fidelity  esper leant  produced  an  average 
Increase  in  mean  response  time  of  only  about  0.7  a«c  for  frontal  versus 
lateral  source  positions,  versus  1.0  esc  for  the  moving  head  condition. 


6.0  LONG-TERM  RESEARCH  AMD  DEVELOPMENT  FLAM 


This  section  Includes  a  long-term  plan  for  davaloping  an  alectronlc 
system  which  could  convey  accurate  directional  information  by  way  of  acouatlc 
signals  to  listeners  wearing  headphones.  The  system  will  have  the  capability 
to  Impress  directional  qualities  on  incoming  signals  and  massages  in  real 
time,  and  to  alter  the  directional  qualities  of  the  sound  in  real  time  with 
head  movement.  The  simulated  auditory  localisation  (SAL)  cuea  delivered  via 
headphones  should  enable  a  pilot  to  localise  sound  with  accuracy  comparable  to 
that  in  free-fleld  listening  conditions. 

6.1  Objectives 

The  overall  objectives  of  the  proposed  program  are  to  establish  technical 
requirements,  design,  build,  and  tast  an  alectronlc  system  for  impressing  a 
directional  quality  on  audio  signals  and  messagaa  in  the  cockpit.  The  system 
is  called  a  real-time  directional  synthesiser  (RTDS).  In  order  to  build  such 
a  system,  the  following  specific  objectives  must  be  accomplished! 

1.  Determine  the  minimus  rate  at  which  the  system  must  sample  head 
position  and  correct  the  apparent  direction  of  the  sound. 

2.  Determine  the  required  spatial  resolution  of  the  system. 

3.  Determine  the  required  bandwidth  and  dynamic  range  of  the  part  of 
the  system  that  synthesises  directional  signals. 

4.  Measure  and  digitize  transforms  that  relate  free-fleld  sound  from 

given  directions  to  the  aound  received  at  the  ear  canal  entrance. 
This  must  be  done  at  the  minimum  spatial  resolution. 

5.  Pit  the  directional  transforms,  in  either  the  time  or  frequency 

domains,  with  digital  filter  models. 

6.  Implement  the  models  on  a  computer  as  non-real  time  digital  filters 

and  use  the  filters  off-line  to  produce  directional  messages  and 

I 
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Test  Che  psychological  fidelity  of  alternative  filter  deals, ns  and 
select  the  bent  filter  models  for  further  research. 

Review  previous  research  to  determine  individual  differences  in 
head-related  transfer  functions  that  are  rc-la’ted  to  superior 
localisation  performance. 

Modify  the  digital  filter  models  to  Incorporate  features  associated 
with  superior  localisation  performance. 

Evaluate  localisation  performance  with  the  modified  filters  ss  a 
function  of  backgroend  no lea  level  with  the  listener's  head  fined. 

I 

Evaluate  the  psychological  fidelity  of  the  modified  filter  in 
producing  sound  movement  relative  to  the  head. 

It*  Specify  the  engineering  requirements  for  a  real-time  directional 
synthesiser  (ETD8)  and  prepare  a  plan  for  development  of  a 
prototypo. 

13.  Design  the  RTD8  hardware  end  software. 

14.  Fabricate  and  bench-teat  the  RTDS  prototype. 

13.  Teat  the  psychological  fidelity  of  the  RIDS  prototype. 

16.  Document  the  RTDS  hardware  end  software  and  recommend  any  needed 
refinements  to  the  design. 

The  proposed  work  builde  upon  the  previous  work  described  in  this  report, 
and  will  use  the  SAL  research  facility  described  in  Section  3.0. 

6.2  Approach 

In  this  section,  alternative  design  approaches  for  an  electronic  system 
for  producing  directional  audio  signals  in  the  cockpit  are  examined.  The 
cepabilltlee  and  characteristics  that  such  a  systam  must  have  include  the 
following: 


7. 

8. 

9, 

10. 

IU 
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1.  It  must  provide  directional  Information  In  a  manner  which  la  not 
workload  intensive.  Fbr  the  human,  turning  the  head  end  eyes  to 
visually  acquire  an  object  that  haa  produced  e  sound  ia  a  natural, 
almost  "automatic"  response  (cf.,  Posner,  1978;  Shiffrin  & 
Schneider,  1977). 

2.  It  must  accurately  reproduce  the  acoustic  cuaa  available  in  free- 
field  listening  for  binaural  headphone  presentation.  A  reasonable 
goal  is  to  enable  the  pilot  wearing  earphones  to  localise  sounds  and 
messages  with  accuracy  comparabla  to  that  with  the  unaided  ear  in  a 
natural  setting  not  characterised  by  high  reverberation  or  noise. 

3.  The  system,  in  conjunction  with  a  system  which  measures  head 
rotation  and  translation,  should  be  able  to  modify  the  directional 
cues  in  real  time  with  sufficient  speed  to  compensete  for  head 
movement,  aircraft  notion,  and  motion  of  tha  object,  If  any,  that 
the  sound  is  signalling. 

4.  The  system  should  minimise  front-back  reversals  in  localisation 
judgements  by  accurately  representing  the  scoustlc  cues  associated 
with  the  external  ears  (pinnae)  and  head  movement. 

5.  The  engineering  requirements  of  the  system,  in  terms  of  information 
storage  and  access  time,  must  be  within  the  bounds  of  existing  or 
near-term  technology,  and  be  such  that  the  system  is  suitable  for 
airborne  applications. 

6.  It  Is  highly  desirable  that  the  system  be  able  to  impress  a 
directional  quality  on  incoming  messages,  including  voice 
communications,  In  real  time.  That  is,  the  system  should  not  be 
limited  to  reproducing  prestored  sounds  and  messages. 

Two  extreme  approaches  in  terms  of  sophistication  for  building  the 
required  system  are  the  followings 


a.  Off-line  recording  for  laeer  playback  of  a  large  number  of  sounds 
and  messages  from  a  large  number  of  directions  relative  to  a  model 
of  the  human  head,  torso,  and  ears. 

b*  ■sal-time  synthesis  of  directional  cues  based  on  modeled  transforms 
for  e  large  number  of  directions  relative  to  the  head. 

The  first  approach  haa  the  advantage  that  It  Is  technologically 
simpler.  However,  it  is  llaltad  to  reproducing  preseored  sounds  and 
■sssagas.  It  also  hae  major  problems  in  tarns  of  the  speed  et  which  stored 
directional  Bounds  most  be  accessed  and  in  terms  of  the  emount  of  storege 
t squired. 

Aa  the  listener,  in  this  case  a  pilot,  turne  the  head,  the  directional 
sound  delivered  to  the  headset  must  be  changed  if  the  source  of  the  sound  Is 
to  appear  to  remain  stationary,  or  aova  in  a  manner  which  la  not  sieved  to 
head  motion.  A  human  can  easily  aova  the  head  at  a  rate  of  90  degrees/sac. 

In  order  for  the  sound  to  appear  to  change  smoothly,  l.e.,  not  Jump  back  and 

forth,  Its  directional  quantise  must  be  changed  before  the  listener's  head 
rotate*  through  an  angle  equal  to  th*  minimum  audible  angle  that  the  human  can 
discriminate.  The  minima  audible  angle  for  sounds  located  In  front  of  the 
listener  is  about  1  degree.  Hence,  the  directional  sounds  must  be  accessed  at 
a  rata  greater  then  90  He  for  sounds  In  this  region.  Of  course,  the  direction 
in  which  the  head  will  be  moved  cannot  be  predicted  In  advance.  The  system 

must  therefore  be  able  to  access  all  the  prerecorded  directions  for  a  given 

sound  with  the  seme  speed,  l.e.,  the  system  must  have  e  "random"  access 
capability. 

Tha  only  storage  medium  which  haa  the  required  random  access  capability 
and  speed  le  solid-state  random-access  memory  (RAM).  An  estimate  of  the 
amount  of  PAM  required  for  this  type  of  system  is  shown  In  Figure  29.  The 
estimate  of  the  number  of  directions  from  which  each  sound  must  ba  recorded  Is 
basad  on  the  audltcry  acuity  measured  in  previous  rsaearch  (see  Section  2.0 
for  a  review).  The  required  RAM  for  this  approach,  96,000  megabytes,  is  not 
feasible  for  airborne  applications  with  current  or  near-term  technology.  Even 
If  the  massage  length  and  number  of  directions  were  restricted,  the  system 
would  still  suffer  the  dleadventage  that  It  cannot  Impress  a  directional 
quality  on  incoming,  real-time  messages. 
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•  Number  of  bytes  of  RAM  required 
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Fiuure  29.  RAM  requirements  for  off-line  pre-recording  approach. 
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The  second  approach,  real-time  synthesis  of  directional  cues,  could  be 
Implemented  by  first  modeling,  off-line,  the  directional  transformation  that  a 
sound  undergoes  from  the  frse-field  to  the  ear  canal  entrance  for  each  of  a 
large  ntaaber  of  directions.  Incoming  messages  could  then  be  digitised  (if 
they're  not  already  in  digital  form)  and  digitally  filtered  using  the  modeled 
directional  transform  for  the  desired  direction  of  arrival*  Kather  than 
storing  directional  messages,  this  approach  would  require  storage  of  only  a 
small  set  of  coefficients  for  each  direction* 

The  real-time  operation  of  this  approach  is  outlined  in  Figure  10.  The 
system  would  perform  the  following  functions  in  real  timet  (1)  digitise  the 
incoming  message,  (2)  input  the  actual  or  disitsd  direction  of  arrival  of  the 
sound  relative  to  the  aircraft,  (3)  compute  the  direction  of  the  Sound 
relative  to  the  pilot's  head,  based  on  head  position  data  from  a  head-tracking 
system,  (4)  look  up  the  appropriate  filter  coefficients  in  RAM,  (5)  digitally 
filter  the  digitised,  incoming  message  in  aither  the  host  computer  or  in  an 
outboard  digital  processor,  and  (6)  convert  the  resulting  waveform  to  analog 
form,  amplify  and  loir-pass  filter  It  as  nscesssry,  and  output  to  the  pi  Lot's 
headphones* 

This  approach  has  three  major  advantages  relative  to  the  first 
approach*  First,  much  less  RAM  ts  required.  Figure  11  shows  the  estimated 
amount  of  RAM  needed.  Second,  access  time  to  waveforms  (in  this  case  stored 
as  model  coefficients)  is  not  a  problem*  Third,  the  nystem  is  not  limited  to 
pre-recorded  sounds. 

The  approach  also  has  two  disadvantages  relative  to  the  first  approach. 
The  first  is  that  a  large  number  ot  directional  transforms  must  be  modeled. 
This  is  not  a  serious  problem,  since  it  need  be  done  only  once,  off-line,  for 
ell  future  operatlona  of  the  system.  The  second  disadvantage  Ls  that  the 
approach  may  requira  a  host  computer  with  high  computational  speed  for  the 
digital  filtering  operation.  This  could  be  overcome  by  using  a  set  of 
special-purpose  outboard  microprocessors  which  perform  the  required 
multiplications  in  parallel. 

In  terms  of  both  capabilities  and  developmental  feasibility,  the  real¬ 
time  synthesis  of  directional  cuaa  teem  to  be  the  preferable  approach.  The 
proposed  research  and  development  therefore  addresses  this  approach. 
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Figure  30.  ConcepCual  diagram  ot  a  real-time  directional  synthesizer 
showing  the  digital  filtering  operation. 
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*•  Number  of  bytes  of  RAM  required  * 
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Figurt  31.  ram  requirements  for  real-time  synthesis  approach 
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6.3  Major  Technical  Issues 

6.3.1  System  Spatial  Resolution 

The  real-time  directional  synthesizer  must  have  sufficient  resolution  to 
make  a  sound  appear  to  cone  from  a  single,  stationary  point  in  apace,  even 
when  the  listener  moves  his  or  her  head.  As  the  head  rotates,  the  directional 
transform  applied  to  the  signal  must  be  altered  to  represent  the  new  direction 
of  the  sound  source  relative  to  the  head.  If  the  difference  between  the  new 
and  old  directions  of  the  sound  source  exceeds  the  minlmtn  audible  angle 
(MAA),  the  sound  will  appear  to  initially  neve  in  the  direction  opposite  head 
movement  and  then  Jump  in  the  direction  of  head  movement.  The  MAA  is  the 
smallest  angular  distance  which  allows  the  listener  to  discriminate  between 
tvx>  successive,  stationary  sounds  (cf«.  Mills,  1958).  If  the  angular  distance 
between  transforms  is  smaller  than  the  MAA,  then  as  the  head  moves,  the 
listener  should  not  detect  an  audible  transition  as  the  transform  is 

changed. 

Parrott  and  Musicant  (1977,  1981)  studied  the  MAA  with  moving  sound 

sources.  Listeners  were  asked  to  adjust  the  time  of  onset  for  a  moving  sound 

source,  such  that  the  onset  occurred  directly  in  front  of  them.  For  source 
velocities  from  45  to  17.0  degrees/sec,  the  MAA  measured  in  this  way  was 
comparable  in  magnitude  to  the  MAA  measured  by  Mills  (1958).  Thus  it  appears 
that  Che  MAA  la  a  valid  measure  of  auditory  acuity  for  moving  as  will  as 

static  sound  sources. 

although  many  previous  investigations  have  measured  auditory  acuity,  the 
methods,  stimuli,  and  source  directions  studied  have  differed  greatly.  As  a 
result,  our  knowledge  of  human  auditory  acuity  is  incomplete.  Many  of  the 
more  recent  studies  have  used  dlchotlc  presentation  and,  for  theoratical 
reasons,  have  us-d  unnatural  stimuli  which  would  never  be  encountered  in  free- 
fleld  conditions.  The  results  of  these  studies  are  therefore  not  useful  for 
determining  the  proper  angular  Interval  at  which  transform  must  be  measured 
for  the  real  Ime  synthesizer. 

The  most  comprehensive  study  of  froe-fleid  acuity  was  conducted  by  Mills 
(1958).  He  studied  the  MAA  for  pure  tones  as  a  function  of  azimuth  in  the 
horizontal  plane  of  the  interaural  axis.  Gardner  and  Gardner  (1973)  maaoured 
i-,c  a 1  1  zat  ion  accuracy  for  connected  speech  stimuli  in  the  median  sagittal 
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plane  and  for  ocher  vertical  planes  at  various  angles  to  the  median  plane. 
Wettschureck  (1973)  measured  the  MAA  for  white  noise  at  various  elevations  in 
Che  median  plane.  Harris  (1972)  replicated  Mills'  results  for  the  0*  azimuth 
region,  but  found  somewhat  smaller  MAA's  for  +30*  azimuth.  Searle,  Braids, 
Davis  and  Colburn  (1976)  developed  a  model  of  auditory  localization  based  on 
data  from  studies  of  horizontal  and  vertical  plane  localization  reported  over 
a  ten-year  period.  Their  model  predicts  the  standard  deviation  of 
localisation  error  (which  la  equivalent  to  the  MAA)  for  the  frontal  horizontal 
plane  and  the  median  plana  as  a  function  of  the  span  of  the  loudspeaker  array 
and  Che  acoustic  cues  available.  Their  model  is  actually  a  sophisticated 
summary  of  the  available  MAA  data.  If  MAA  data  were  available  for  other 
angular  regions,  the  model  could  be  extended  to  those  regions.  It  could  serve 
as  a  tool  for  estimating  the  MAA  between  maasurad  directions,  and  perhaps 
reduce  the  number  of  directions  for  which  data  muet  be  collected  In  this 
program. 

Although  previous  rsssarch  will  provide  a  useful  guide,  systematic 
measurements  of  the  MAA  era  needed  at  closely  spaced  Intervals  for  virtually 
all  combinations  of  azimuth  and  elevation.  From  previous  research  on  auditory 
localization  (see  Section  2.0  for  a  comprehensive  review)  it  appears  that 
localisation  accuracy  la  aa  great  or  greater  for  white  noise  than  for  any 
other  auditory  stimulus.  Vhlta  noise  la  therefore  the  logical  chol.ce  as  a 
standard  stimulus  for  the  quantification  of  the  MAA. 

Teak  1  of  Phase  II  consists  of  experimental  research  to  fill  the  gaps  in 
the  available  data  on  the  MAA.  The  MAA  will  be  measured  for  white  noise  at 
Incremsnts  of  azimuth  and  elsvation  over  the  entire  spherical  space 
surrounding  the  listener's  head.  The  MAA  should  govern  the  required 
resolution  of  the  synthesizer  when  the  movement  of  the  sound  source  relative 
to  the  head  is  slow.  When  the  movement  of  the  head  and/or  sound  Is  more 
rapid,  the  auditory  apparent  movement  illusion  may  provide  a  better  means  of 
producing  the  perception  of  smooth  movement  of  the  sound  relative  to  the 
heed.  The  exploitation  of  the  apparent  movement  phenomenon  is  discussed  In 
the  next  section. 


6.3.2  System  Response  Time 


In  order  to  be  useful  in  the  cockpit,  the  reel-time  synthesizer  must  be 
integrated  with  a  system  which:  (a)  measures  head  translation  and  rotation  at 
a  high  sampling  rate,  and  (b)  rapidly  computes  the  appropriate  sound  direction 
relative  to  the  head.  This  information  must  be  output  to  the  real-time 

synthesiser,  which  will  produce  a  sound  with  the  appropriate  directional 

qualities.  The  response  time  of  such  a  system,  from  the  pilot's  execution  of 
a  head  movement  to  the  production  of  the  sound  at  the  new  apparent  direction, 
must  be  short  enough  to  make  the  sound  appear  stationary  during  head  movement 
and  to  simulate  the  smooth  movement  of  sound  sources  about  the  head. 

The  simplest  algorithm  for  achieving  perceptually  smooth  movement  of  the 
sound  relative  to  the  head  would  be  to  successively  transition  between 
transforms  which  characterize  directions  differing  by  less  than  one  MM.  This 
approach  puts  very  heavy  demands  on  the  response  time  of  the  system.  A  human 

listener  can  easily  turn  his  or  her  head  at  a  rats  of  90* /sac.  For  sound 

sources  located  in  the  anterior  median  plane,  where  most  objects  of  lntaraet 
to  a  pilot  would  be,  the  HM  is  se  small  aa  1*  (Mills,  1958).  In  order  to 
keep  up  with  head  movemant,  the  system  would  need  to  transition  among 
transforms  and  produce  appropriate  sounds  at  a  rata  of  90  Hz. 

An  alternative  approach  to  minimizing  system  renponse  time  is  to 
synthesize  only  a  raduced  set  of  localization  cues  during  rapid  head 
movements.  Freedman  A  Plahar  (1963)  have  suggeeted  that  pinnae  cues  do  not 
contribute  to  localization  accuracy  during  head  movemsnis,  but  do  help  direct 
the  initial  orienting  response  (head  movemant)  to  a  sound.  Thay  maaeurad 
localization  accuracy  in  the  horizontal  plane  with  listeners  using  their  own 
pinnae,  with  their  pinnae  occluded,  and  with  artificial  pinnae,  with  or 
without  head  movement  allowed.  With  no  head  movement,  their  own  or  artificial 
pinnae  increased  localization  accuracy  considerably  relative  to  the  no-pinnae 
condition.  However,  when  head  movement  was  allowed,  there  was  no  significant 
difference  between  the  pinnae  and  no-ptnnae  conditions. 

This  result  suggests  that  the  modulation  of  binaural  cues  during  head 
movements  is  io  powerful  that  It  awamps  out  any  effect  of  monaural  cues.  If 
this  is  the  case,  then  it  should  be  possible  to  simulate  only  the  changing 
Interaural  time  difference  (ITD)  and  interaural  amplitude  difference  (IAD) 
during  rapid  head  movements  with  no  loss  of  localization  accuracy.  It  would 
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be  relatively  simple  to  vary  tha  ITD  of  the  signal  reaching  the  pilot's 

ha  ad  phones  as  a  function  of  head  orientation  In  real  tine.  Since  the  IAD 

increases  as  a  function  of  frequency,  it  would  be  necassary  to  high-pass 
filter  the  signal  In  a  direction-dependant  manner.  Such  a  filter  would  be 
much  simpler,  however,  then  a  filter  which  simulates  ell  the  localisation 

cues.  Also,  It  might  not  be  necessary  to  alter  the  characteristics  of  the  IAD 
high-pass  filtar  at  HAA-slsa  incrsments  as  the  sound  moves  relative  to  the 
head.  The  filter  could  be  changed  as  s  continuous  function  of  the  angle  of 
tha  sound  relative  to  tha  interaural  axis.  Reducing  the  cues  simulated  should 
greatly  reduce  the  amount  of  processing  needed  during  rapid  head  movements  and 
therefore  reduce  the  required  system  speed. 

Am  interesting  question  is  whether  the  "reduced  cue"  approach  could  be 
used  to  simulate  movement  of  e  sound  about  a  stationary  head.  The  acoustic 
information  racaivad  by  tha  ear  should  be  tha  same  regardless  of  whether  the 
head  moves  relative  to  the  sound  or  vice-versa.  It  is  possible,  but  not 

likely,  that  proprioceptive  feedback  from  the  neck  muscles  changes  the  way  in 
which  auditory  Information  is  processed  centrally,  l.a.,  somehow  promotes  ths 
use  ef  pinnae  cuss.  Even  If  this  ware  the  case,  tha  reduced  cue  approach 
would  still  be  useful.  In  the  present  application,  the  maximum  angular  rate 
of  haad  movement  is  much  greeter  then  the  angular  rate  at  which  an  object 
would  be  expected  to  move  about  tha  pilot  end  aircraft.  If  necessary,  the  two 
types  of  movement  could  be  simulated  by  different  methods.  When  the  head  Is 
moving  rapidly  one  could  use  the  reduced  cue  approach;  when  the  haad  la 
stationery  or  slowly  moving,  one  could  successively  transition  among 
transforms  associated  with  directions  differing  by  loss  then  one  MAA. 

A  third  possible  strategy  is  Co  exploit  the  sppsrsnt  movement  (beta) 
phenomenon.  Taking  advantage  of  the  apparent  movement  phenomenon  would  make 
it  possible  to  synthesise  rapid  movements  of  s  sound  relative  to  the  head  by 
successively  activating  transforms  which  era  ss par scad  by  more  than  one  MAA. 

When  two  spatially  separated  stimuli  srs  presented  in  succession,  the 
observer  may  perceive  not  two  stimuli,  but  s  single  stimulus  which  moves  from 
ths  position  of  tha  stimulus  which  appeared  first  to  the  posit  lot)  of  the 
second  stimulus.  In  order  to  obtain  apparent  motion,  the  interval  between  the 
onsets  of  trie  two  stimuli,  the  distance  between  the  stimuli,  and  the  Intensity 
of  tha  stimuli  must  be  in  the  proper  proportion.  The  phenomenon  also  depends 
on  the  duration  of  the  stimuli. 
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Wertheimer  (1912)  and  Korte  (1915)  reported  extensive  studies  of  apparent 
movement  in  the  visual  modality  (cited  In  Murch,  1973).  According  to  Korte, 
as  the  distance  between  the  stimuli  Is  Increased,  the  lnterstinulus  onset 
Interval  (ISOI)  must  be  Increased  proportionately  in  order  to  maintain  the 
apparent  movement.  When  the  observer  perceives  one  object  moving  smoothly 
from  the  position  of  the  first  stimulus  to  the  second,  Che  phenonanon  is 
called  "beta"  movement.  As  the  1301  la  lengthened,  a  point  is  reached  at 
which  the  movement  no  longer  appears  smooth,  but  broken  or  jerky.  The  broken 
movement  Is  called  the  "phi"  phenomenon.  As  the  ISOI  la  lengthened  further, 
the  observer  eventually  perceives  tuo  spatially  separated  stimuli  appearing 
successively.  Visual  apparent  movement  plays  an  important  role  in  virtually 
all  types  of  visual  simulation.  Including  such  mundane  applications  as  motion 
pictures. 

The  apparent  movement  phenomenon  also  occurs  in  the  auditory  modality 
(Burtt,  1917;  Msthlaaen,  1931;  Briggs  &  Parrott,  1972;  Geldard,  1984).  Bach 
of  these  studies  confirmed  the  ganaral  relationships  described  by  Korte  among 
the  ISOI,  the  distance  separating  the  stimuli,  the  Intensity  of  the  stimuli, 
and  the  stimulus  duration.  However,  Che  optimal  ISOI  for  the  auditory 
modality  (25  to  30  meec  for  stimuli  10  to  60  centimeters  apart)  appears  to  be 
shorter  than  the  optimal  ISOI  for  visual  stimuli  (about  75  to  125  msec  for 
stimuli  a  few  centimeters  apart). 

Among  the  studies  employing  acoustic  stimuli,  the  ratio  of  the  distance 
between  the  stimuli  to  the  ISOI  differed  greatly.  This  ratio  is  important  In 
the  present  application  because  It  Is  related  to  the  rate  at  which  the  sound 
moves  relative  to  the  head,  which  must  be  precisely  controlled.  The 
differences  among  the  studies  are  probably  attributable  to  differences  In  the 
methods  and  stimuli  used.  In  Che  earlier  studios  the  equipment  was  quits 
crude  by  modern  standards. 

Further  research  la  needed  to  determine  how  to  take  advantage  of  the 
apparent  movement  phenomenon  In  the  present  application.  Questions  of 
Interest  Include  the  effect  of  the  type  of  stimulus,  the  ungluar  distance  over 
which  apparent  movement  can  be  achieved,  the  rates  of  movement  which  are 
possible,  how  the  foregoing  variables  differ  as  a  function  of  the  angular 
region  In  which  the  movement  occurs  (e.g..  In  front  vs.  in  the  back  of  the 
head),  and  Individual  differences  In  Che  optimal  conditions  for  apparent 
auditory  movement. 
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The  proposed  research  is  organized  Into  chree  phases.  The  overall  goal 
of  Che  first  phase  is  to  determine  the  feasibility  of  the  digital-filter 
approach  to  developing  e  real-time  directional  synthesiser.  This  will  be 
accomplished  by  providing  preliminary  answers  to  five  major  technical  Issues. 
The  issues  Include:  (1)  the  einimua  system  response  time  required,  (2)  the 
spatial  resolution  required,  (?)  the  required  bandwidth  of  the  synthesizer, 
(4)  the  required  dynamic  range  of  the  synthesiser,  and  (5)  individual 
differences  in  head-related  transfer  functions  (HRTF'a)  which  are  correlated 
with  superior  locallsetlon  performance,  and  therefore  might  be  used  to  enhance 
performance  for  all  listeners*  It  la  anticipated  that  the  third  and  fourth 
laauas  can  be  answered  satisfactorily  by  reviewing  previous  research 
findings*  The  remaining  issues  will  require  additional  experimental 
research*  Phase  l  will  include  the  reviews  of  previous  research  for  all  five 
Issues  and  preliminary  experimental  research  on  the  first  issue. 

The  overall  goal  of  Phase  II  la  to  determine  the  engineering  requirements 
for  the  real-time  directional  synthesiser.  The  first  task  will  examine  the 
spatial  resolution  required  in  such  a  synthesiser  in  order  to  maintain 
psychological  fidelity  during  head  movements  and  preserve  head-movement 
related  cues  to  the  location  of  the  sound  source.  Also  in  this  phase,  the 
time-domain  waveforms  containing  directional  information  will  be  measured  and 
digitised.  This  data  will  then  be  uaad  to  evaluate  alternative  digital  filter 
models.  Next,  the  best-fitting  digital  filter  models  will  be  implemented  on  a 
computer  ao  that  ttv?y  can  be  evaluated  In  nonreal  time.  A  small  number  of 
meaaages  will  be  digitised  and  convolved  with  the  Alternative  nonreal  time 
digital  filters  to  produce  messages  which  have  a  directional  quality.  These 
directional  messages  will  then  be  used  in  a  variety  of  experimental  tests. 
One  sst  of  tests  will  examine  the  psychological  fidelity  of  alternative  filter 
models  In  taros  of  localisation  accuracy  and  speed.  The  results  of  these 
teats  will  be  used  to  select  one  or  two  filter  modeLs  for  further  experimental 
testing. 

A  second  set  of  experiments  will  examine  the  utility  of  emphanlzing 
selected  features  of  the  HKTF's  in  terms  of  localization  accuracy  and  speed. 
Individual  differences  in  HRTF's  that  are  associated  with  superior 
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localization  performance  will  b«  emphasized  by  altering  the  digital  filter. 
These  modified  HRTF's  will  be  tested  with  human  listeners  to  determine  which 
features  should  be  retained  in  Che  filter  model.  The  results  of  these  two 
sets  of  experiments  relate  directly  to  the  engineering  requirements  for 
maximizing  the  static  fidelity  of  the  synthesizer.  That  is.  for  maximizing 
localization  performance  when  both  the  head  and  sound  source  are  fixed. 

A  third  set  of  experiments  will  examine  the  psychological  fidelity  of  the 
system  when  the  sound  source  moves  with  respect  to  the  head  (l.e. ,  in 
conditions  of  head  movement  relative  to  a  stationary  sound,  or  sound  movement 
about  a  stationary  head,  or  a  combination  of  the  two). 

At  the  end  of  Phase  II,  the  major  design  requirements  for  the  real-time 
directional  synthesizer  will  have  been  determined.  The  last  major  task  in 
Phase  II  will  be  to  summarize  these  requirements,  including  recommendations 
for  any  further  research  needed,  and  to  develop  alternative  design  approaches 
for  building  a  real-time  directional  synthesizer. 

In  the  third  phase  of  the  proposed  program,  a  design  will  be  developed 
for  the  real-time  directional  synthesiser.  A  prototype  will  be  built  and 
bench-ceated.  This  phase  will  also  Involve  the  measurement  of  a  large  number 
of  directional  waveforms.  A  series  of  experimental  tests  will  be  conducted  in 
Phase  III  to  evaluate  the  psychological  fidelity  of  the  prototype  synthesizer 
and  to  optimize  the  synthesizer's  performance.  Recommendations  will  be  made 
for  any  needed  refinements  to  the  synthesizer  design  and  for  any  necessary 
further  testing  of  the  synthesizer. 
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7.0  APPMDIX 


AJWOTATBD  bibliocsafht 


Article*  were  selected  for  annotation  based  upon  chair  relationship  to 
the  goals  of  cha  literature  review.  In  this  regard,  Che  nejorlcy  of  articles 
Included  ooncencreCed  on  auditory  localisation  models  and  literature  reviews, 
horlsontal  and  median-plane  localization  cues,  audlcory  distance  cues,  effects 
of  head  movement  and  vision  on  localization,  perception  of  auditory  motion, 
and  blsensory  Interaction.  A  small  sample  of  articles  Involved  noise  effects, 
alternate  environments ,  auditory  displays,  stereophonic  listening  principles, 
and  acoustic  manikin  specifications.  A  topical  index  which  cross-references 
the  ennotations  according  to  the  above  categories  la  included  at  the  end  of 
this  appendix. 
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J.  Alekseenko,  N.  Yu.  (1983).  Role  of  movements  of  different  types  in 

spatial  hearing  acuity.  Human  Physiology.  240-243. 

The  acuity  of  spatial  hearing  (l.a.,  the  ainleai  audible  angle)  Was 
evaluted  with  respect  to  spatial  location  (0*  vs.  30*  aalmuth),  type 
of  response  (verbal  vs.  pointing),  and  head  orientation  (facing 
forward  or  facing  to  the  side).  For  verbal  responses,  performance 
was  better  at  0*  than  at  30*  aiimuth.  Pointing  improved  performance 
at  30*.  Head  orientation  had  no  effect. 

2.  Andreaasi,  J.  L. ,  &  Greco,  J.  R.  (1973).  Effects  of  blaensory 

stimulation  on  reaction  time  and  the  evoked  cortical  potential. 
Physiological  Psychology.  139-194. 

This  study  coapared  blaensory  stimulation  to  unlaaneory  (visual  or 
auditory)  stimulation,  using  both  reaction  time  (RT)  and  evoked 
potential  (EP)  as  dependent  variables.  Two  methods  were  used  to 
determine  the  temporal  offset  between  the  two  stimuli  in  the 
blaensory  condition.  The  first,  &R,  set  the  offset  equal  to  the 
difference  between  the  visual  and  auditory  RTe.  The 

second,  An2,  sat  the  of  fast  equal  to  the  difference  between  the 
visual  and  auditory  N2  latency  component  of  the  EP  meaaure.  The 
results  indicated  that  visual  RT  was  faster  than  auditory  RT,  and 
blaensory  stimulation  was  faster  than  the  visual  response.  The  EP 
measure  reaulted  in  shorter  latencies  In  audition,  and  the  amplitude 
of  blaensory  stimulation  was  higher  than  in  unimodal  stimulation. 
It  was  concluded  that  the  Increased  amplitude  of  the  blaensory 
stimulation  condition  resulted  in  facilitation  of  response  time* 

3.  Batteau.  D.  W.  (1967).  The  role  of  the  pinna  in  human  localisation. 

Proceeding  of  the  Royal  Society,  London.  Series  B.  163.  (pp.  138- 
180).  London,  England. 

A  theoretical  analysis  on  the  role  of  the  pinna  in  sound 
localization  was  presented.  The  pinna  introduces  quantitatively 
different  time  delays  on  incoming  signals  dependent  upon  the 
location  of  the  source.  These  transformations,  It  la  proposed,  ere 
used  by  the  auditory  system  to  recreate,  via  an  Inverse  transform  of 
the  time  delays,  tha  location  of  the  sound.  Tha  mathareatlcal 
procaaa  of  encoding  these  time  delaye  was  shown  to  ba 
physiological ly  possible,  basad  upon  a  fairly  almpla  naural  nat  of 
excitation  and  inhibition. 

4.  Batteau.  D.  W. ,  Plante,  R.  L. ,  Spencer,  R.  H. ,  &  Lyle,  W.  E.  (1963). 

Localization  of  aounds  Part  3.  A  new  theory  of  human  audition. 
(Ra port  No.  TP3109,  Part  i!).  china  Lake,  CA:  U.  S.  Naval  Ordlnanca 
Test  3tatlon. 

A  theoretical  analysis  of  sound  localization,  baaed  upon  tha  role  of 
the  pinna  as  a  means  of  Introducing  time  delays  on  Incident  auditory 
stimulation,  was  presented.  Time  delay  transformations  enable  the 
auditory  system,  using  information  theory  metrics,  to  perform 
monaural  autocorrelations  and  binaural  crosacorrelatlont  to  encode 
the  signal  source.  The  advantages  of  a  time  domain  theory  over  the 


183 


classical  fraquancy/ intensity  theory  Include  mechanisms  to  account 
for  monaural  localisation,  selective  attention,  localization  without 
head  movements,  and  masking  phenomenon  The  report  emphasized  the 
Importance  of  high  fidelity  equipment  and  the  acouatic  properties  of 
materials  in  recording  time  delay  and  developing  artificial  pinnae. 
Empirical  evidence  found  time  delay  differences  with  changes  in 
sound  source  aslmuth  and  elevation,  and  applied  theory  successfully 
by  testing  underwater  localisation  ability. 

3.  Batteau.  D.  W. ,  Plante,  R.  L. ,  Spencer,  R*  H. ,  &  Lyle,  W.  E.  (1963). 

Localisation  of  sound>  Part  3.  Auditory  perception.  (Report 
No.  TP31Q9,  Part  5).  China  Lake,  CAt  U.  S.  Naval  Ordinance  Test 
Station. 

The  basilar  membrane  la  implicated,  and  mathematically  modeled,  as 
the  structure  responsible  for  the  encoding  of  time  delays  Imposed 
through  pinna  reflections  on  auditory  stimuli.  This  refinement  in 
dairy  theory  resulted  in  the  Investigation  of  human  speech 
recognition  factors,  and  selective  attention.  The  development  of 
effective  electrostatic  headphones  for  use  in  accurately 
transmitting  localisation  information  mas  presented,  and  performance 
specifications  givan. 

6.  Biguar,  B»,  Jaannarod,  M. ,  &  Prablac,  C.  (1982).  Tha  coordination  of 

aya,  ha ad,  and  arm  movement a  during  reaching  et  a  single  visual 
target.  Experimental  Brain  Research.  46.  301*304. 

The  latency  of  eye,  heed,  end  arm  movements  directed  at  tha  earns 
visual  target  were  measured  in  five  human  subjects.  Latency  of 
activation  of  the  correa ponding  nack  and  arm  muscles  was  also 
measured.  Although  the  overt  movements  occurad  sequentially  (eye 
movement,  heed  eovement,  then  arm  covenant),  BMC.  discharges  wsre 
synchronous  with  respect  to  eye  movement  onset.  The  authors  suggest 
that  the  relative  synchrony  of  neural  commands  for  eye,  head,  and 
arm  movements,  in  producing  tha  pattarnad  aaquence  of  overt 
movements,  may  have  Implications  for  aye-hand  coordination  (i.e., 
subjects  mads  largar  pointing  arrors  when  not  allowed  to  move  their 
head  and  aye a  toward  the  targat). 

7.  Blauert,  J.  (1969/1970).  Sound  localization  in  the  median  plane. 

Acouatica.  22.  205-213. 

The  thaory  of  timbre  differences  was  appllad  to  auditory 
localisation  in  tha  median  plane.  The  thaory  propoaad  that  linear 
distortions  in  tha  fraquency  spectra,  cauasrf  by  pinna,  head,  and  ear 
canal  reflactlons,  changed  dependent  upon  the  angle  of  Incidence  of 
the  sound  sourct.  In  ths  first  experiment,  listeners  reported 
whether  e  source  appeared  above,  in  front,  or  behind  them.  The 
results  showed  that  localization  was  a  function  of  frequency.  Low 
frequencies  (<  630  Hs)  and  frequencies  between  2300  and  6300  Hz  were 
perceived  as  coming  from  the  front,  frequencies  between  800  and  2000 
Hs  end  between  8000  and  12000  Hs  were  pttreeived  as  behind  the 
subject,  and  frequencies  between  6300  and  8000  Hz  were  pttreeived  as 
above  tha  listeners.  A  second  experiment  measured  the  frequency 


184 


Afljuuwum  H.UL.1 4  U  >|J 


components  of  sound  st  the  sard rum,  and  confirmed  that  the  frequency 
changes  were  indeed  correlated  with  the  perceived  location.  In  the 
third  experiment,  recordings  from  the  ear  canal  were  played  back  to 
the  listeners  and  ware  perceived  as  in  experiment  1.  These  studies 
were  taken  as  support  for  the  directional  band  theory  of  median 
plana  localisation. 


8. 


Blauert,  J.  (1971).  Localization  and  the  law  of  the  first  wavefront  in 
the  median  plane.  The  Journal  of  the  Acoustical  Society  of  America. 
50,  466-470.  '  — ' 


This  study  investigated  and  evaluated  the  precedence  effect 
according  to  the  directional  band  theory  of  localisation.  The 
precedence  effect  is  the  suppression  of  another  sound  when  It 
follows  a  primary  sound  between  600  Msec  to  50  msec.  Below  600  Msec 
the  second  sound  Is  suaaed  with  the  first  to  erects  phantom  sound 
sources;  above  50  msec  the  second  sound  Is  perceived  as  an  echo  of 
the  first.  Using  the  method  of  constant  stimuli,  Blauart  found  that 
judgements  of  stimulus  location  (abova,  front  or  behind)  did  not,  as 
expected,  follow  the  predictions  of  the  precedence  effect  when  the 
secondary  source  followed  the  primary  sound  by  less  than  500  Msec. 
Blauert  interpreted  these  results  ss  confirming  the  relevance  of  the 
directional  band  theory  of  localisation.  Measurement  of  the 
frequency  spectrum,  with  various  dslay  Intervals,  revealed  that 
linear  distortions  in  the  spectrus  were  consistent  with  the 
predictions  of  a  combfilter  transformation. 


9.  Blauart,  J.  (1981).  Lateralization  of  Jtttared  tones.  The  Journal  of 
the  Acoustical  Society  of  America.  70.  694-698. 

This  study  reviewed  the  hypothesis  that  auditory  lateralization  was 
based  on  lnteraural  time  differences  (ITDs)  for  complex  high 
frequency  signals.  It  was  proposed  that  lateralization  could  be 
accomplished,  even  when  the  signal  does  not  contain  any  components 
below  1600  Hz  and  the  signal  envelopes  are  flat,  via  a  peripheral 
auditory  filter  which  converts  PM  to  AM  signals.  The  output  of  this 
transformation  would  produce  time-varying  envelopes  in  previously 
flat  signals.  This  hypothesis  was  evaluated  using  Jittered  tones. 
In  contrast  to  Nordmark  (1976),  who  found  ITDs  as  small  as  2 
microseconds  in  jittered  tones,  these  results  (173  microseconds)  did 
not  differ  from  previous  ITD  thresholds  found  with  non-Jittered  PM 
tones.  It  was  concluded  that  the  results  did  not  contradict  the  PM 
to  AM  transformation  hypothesis,  but  tltat  no  new  auditory  mechanism 
was  necessary  to  explain  localization  with  jittered  versus  non- 
Jlttered  tones. 


10.  Blauert,  J.  (1982).  Binaural  localization.  In  0.  J.  Pedersen  and  T. 

Poulsan  (Kds.),  Binaural  Effects  In  Normal  and  Impaired  Hearing 

(Scandinavian  Audioiogical  Supplement  15,  pp.  7-20). 


Thin  review  evaluated  the  relationship  between  physical 
characteristics  of  sound,  as  transformed  by  the  external  ear,  and 
the  various  psychophysical  aspects  of  auditory  localization  cues. 
The  purpose  was  to  develop  a  conceptual  model  of  signal  processing 
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Consistent  with  Che  above  physical-psychophysical  relationship. 
Blauert  concluded  chat  Che  use  of  dichotic  listening  casks  was 
Inadequate  In  elucidating  binaural  Interactions  in  sound 
localisation*  It  mbs  further  suggested  that  mors  use  be  made  of 
manikins  with  external  ears  In  the  Identification  o.1  binaural  cues. 

11*  Burkherd,  M.  0. ,  &  Sachs,  R.  H.  (1979).  Anthropometric  manikin  for 

acoustic  research.  Journal  of  the  Acoustical  Society  of  America. 
5£,  214-222. 

The  development  of  the  KEMAR  anthropometric ally  designed  manikin  was 
reviewed.  Anthropometric  data  and  KEMAR  physical  specifications 
(head  else,  orientation,  and  pinna  dimensions)  were  emphasised.  The 
acoustical  characteristics  of  the  KEMAR  esrdrum  simulator  are  also 
identified  and  validated  against  human  auditory  data.  The  KEMAR 
manikin  adequately  simulates  acoustic  diffraction  and  responses  in 
tbs  50th  percentile  adult  population. 

12.  Butler,  R.  A.  (1969a).  Monaural  and  binaural  localisation  of  noise 

bursts  vertically  in  the  median  sagittal  plane.  The  Journal  of 

Auditory  Research.  3.  230-233.  - 

Localisation  in  the  vortical  plane,  due  to  the  lack  of  lnteraural 
phase  or  intensity  differences!  is  often  viewed  as  being  difficult 
and  inaccurate.  However,  if  high  frequency,  complex  sound  sources 
are  utilised,  performance  can  be  quite  accurate.  Butler 
investigated  the  role  of  possible  binaural  cues,  based  upon  pinna 
transformations,  using  stimuli  which  varied  in  frequency 
composition.  In  all  stimulus  conditions  binaural  performance  was 
more  accurate  than  sionaural  performance.  In  addition,  broad  band 
noise  and  high  pass  noise  produced  better  performance  than  low  band 
pass  noise.  The  results  support  the  possible  use  of  pinna  transform 
cues  in  the  vertical  plane,  and  tha  necessity  of  high  frequency, 
complex  tones  to  make  use  of  these  cues. 

13.  Butler,  R.  A.  (1969b).  On  the  relative  usefulness  of  monaural  and 

binaural  cues  in  locating  sound  In  space.  Psychonomic  Science.  1 7, 
243-246. 

It  was  hypothesised  that  since  binaural  cues  are  available  in  the 
horlsontal  plane,  but  not  the  vertical  plane,  performance 
difference#  would  reflect  the  increased  number  of  cuee  available  In 
a  binaural  task.  In  comparing  horlsontal  to  vartlcal  localization 
performance,  it  was  concluded  that  the  binaural  cues  in  the 
horlsontal  plane  were  the  causal  factor  which  improved  localization 
accuracy  above  that  found  in  the  vertical  plane. 

14.  Butler,  R.  A.  (1970).  The  effect  of  hearing  Impairment  on  locating 

sound  in  the  vertical  plane.  International  Audiology,  117-126. 

This  article  examined  localization  accuracy  in  the  vertlcaL  plane, 
using  three  groups  which  differed  as  to  their  hearing 
deficiencies.  Subjects  with  a  bilateral  hearing  Impairment  for 
frequencies  in  the  region  of  8000  Hz  wete  compared  to  subjects  with 
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a  unilateral  hearing  impairment  and  a  normal  hearing  group.  The 
bilateral  hearing  impaired  group  was  completely  unable  to  localise 
in  the  vertical  plane,  but  was  proficient  in  the  horizontal  plane. 
The  unilateral  Impairment  group  performed  significantly  better,  but 
not  as  well  as  tha  normal  htering  group.  The  results  support  the 
notion  that  the  availability  of  high  frequency  information  1b 
important  in  localizing  in  tha  vartical,  but  not  tha  horizontal, 

plane.  The  difference  between  the  monaural  impairment  and  normal 
groups  suggested  that  binaural  cues  were  ueed  in  vertical  plane 
localization. 

15.  Butler,  R.  A.,  &  Belendiuk,  K.  (1977).  Spectral  cues  utilized  in  the 

localization  of  sound  In  tha  median  sagittal  plana.  Tha  Journal  of 
the  Acoustical  Sociaty  of  Amarlca.  61 .  1264-1269. 

Three  experiments  investigated  the  role  of  spectral  cues  in  the 
localization  of  sound  in  tha  vartical  plane.  Nolea  bursts  were 
recorded  via  microphona  implants  in  the  external  aar  under  free 

field  conditions.  Localization  accuracy  whan  tha  stimulus  was 
presentad  through  haadphonaa  was  identical  to  frae  flald  conditions. 
Spactral  analyaia,  aummad  over  subjects,  ravealad  a  notch  in  the 
frequency  response  which  was  dependent  upon  frequency.  Comparisons 
among  subjects  showed  that  tha  frequency  spectra  recorded  in  the 

ears  of  highly  accurate  subjects  consisted  of  e  regular  notch  which 

decreased  with  frequency,  while  the  poor  localizer's  notch  was  not 
as  orderly.  This  notch  in  the  sound  spectrun  Is  apparently  related 
to  vertical  plane  localisation  accuracy. 

16.  Butler,  R.  A.,  &  Flannery,  R.  (1980).  The  spatial  attributes  of 

stimulus  frequency  and  their  role  in  monaural  localization  of  sound 
In  the  horizontal  plane.  Perception  and  Paychophyaice.  28.  449-457, 

The  effects  of  frequency  composition  in  monaural  localization  were 
investigated.  It  was  found  that  as  the  center  frequency  of  s  1  kHz 
vide  band  noise  Increased,  within  limits,  the  apparent  location  of 
the  sound  moved  from  0  to  270  dagrees  azimuth.  Howavar,  at  certain 
critical  frequencies  the  apparent  location  would  return  to  0a  and 
then  progress  again  towards  270®  as  the  center  frequency  increased. 
This  pattern  of  responses  was  Interpreted  as  separate  spatial 
referents,  which  were  dependant  upon  frequency  composition. 

Individual  differences  In  terms  of  the  ranges  of  the  spatial 
reference  maps  (SRMs)  were  evident.  Bach  subject  displayed  either 
two  or  three  separate  SRMs.  SRM  1  tended  to  range  between  4  and  8 
kHf,  SRM  2  ranged  from  8  to  12  kHz,  and  SRM  3  ranged  from  13  kHz  and 
above.  In  a  second  experiment  the  location  of  the  center  frequency 
was  varied  such  that  for  each  subject  the  frequency  range  fell 

either  within  a  single  SRM  or  between  SRMs.  Accuracy  was  better  In 
the  between-SRM  condition. 

1 7 .  ButLer,  R.  A.,  &  He.lwlg,  B.  A.  (1983).  The  spatial  attributes  of 

stimulus  frequency  In  the  median  sagittal  plane  and  their  role  In 
sound  localization.  American  Journal  of  Otolaryngology,  4_,  165-1  73. 

This  study  Investigated  the  role  of  spaLial  referents  In  the  median 
plane.  The  apparent  location  of  a  sound  source  was  dependent,  not 
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upon  its  actual  location,  but  rather  on  the  canter  frequencies  of 
1.0  kHz  wide  noise  bands.  As  the  center  frequency  Increased  from  4 
to  12  kHz,  the  apparent  location  of  the  sound  shifted  from  in  front 
to  the  back  of  the  subject*  At  13  kHz  the  aound  again  shifted  to 
the  front.  Differences  in  spatial  reference  maps  was  attributed  to 
differences  in  the  pinna  amplification  function, 

18.  Chason,  L*  R. ,  McFarland,  T.  P. ,  &  Aldrich,  T.  B.  (1971).  Auditory 

effects  on  spatial  orientation  (Report  No.  71-10).  Colorado 
Springs,  CO:  United  States  Air  Force  Academy.  (AD  737331) 

i 

The  purpose  of  this  research  was  to  investigate  the  use  of  beat 
changes  as  a  cue  to  maintain  spatial  orientation.  The  task  required 
subjects  to  adjust  the  position  of  a  rod  via  a  joystick  until  the 
rod  was  parallel  with  a  reference  frame.  Visual  cues  present  or 
absent  and  auditory  beat  cues  present  or  absent  were  fully  crossed 
between  subject  variables.  The  dependent  variable  was  absolute 
aegis  error.  The  results  indicated  that  the  presence  of  both 
auditory  and  visual  cues  produced  the  moat  accurate  performance. 
The  use  of  visual  cues  alone  was  only  marginally  more  accurate  than 
auditory  cues  alone. 

19.  Coleman,  P.  D.  (1962).  Failure  to  localise  the  source  distance  of  an 

unfamiliar  sound.  The  Journal  of  the  Acoustical  Society  of  America, 
34  ,  343-346. 

The  role  of  experience  In  localising  the  distance  of  a  sound  source 
was  investigated.  The  results  indicated  that,  on  the  first  trial, 
distance  estimates  were  unrelated  to  the  actual  source  distance. 
However,  with  repeated  exposures,  the  ability  to  accurately  judge 
the  distance  of  the  source  increased.  It  was  concluded  that 
distance  estimation,  in  this  case,  was  based  upon  the  relative 
comparison  of  sounds  on  successive  trials,  rather  than  on  absolute 
distance  cues. 

20.  Coleman,  P.  D.  (1963).  An  analysis  of  cuss  to  auditory  dapth  parcaptlon 

in  fraa  apace.  Psychological  Bulletin.  60,  302-315. 

Cuea  for  auditory  distance  perception,  and  their  supporting 
evidence,  were  reviewed.  The  attenuation  of  tha  amplitude  of  a 
aound  wave  with  lncreaaaa  in  distance  was  confirmed  as  a  monaural 
cua  to  dlstanca  estimation  whan  familiar  sounds  were  employed.  The 
frequency  spectra  of  a  sound  source  was  also  implicated  as  a 
monaural  cue  to  distance  estimation.  At  far  distances  (>  4  feet) 
the  wave  front  is  approximately  planar  and  provides  distance  cues 
based  due  to  the  differential  attenuation  of  high  frequencies, 
relative  to  lower  frequencies.  This  exponential  loss  la  dependent 
upon  humidity,  temperature,  terrain  features,  and  inhomogeneltles  in 
the  atmosphere.  Binaural  cuss,  such  as  intensity  and  phase 
differences  between  tha  two  ears,  ware  also  investigated  as  possible 
distance  cute.  The  ratio  of  sound  pressure  at  the  two  ears  (l.e., 
the  binaural  Intensity  ratio,  or  BIR)  changes  with  distance  as  a 
function  of  azimuth.  The  change  in  distance  necessary  to  produce 
notlcaabla  changes  in  BIR  values  increases  as  either  azimuth  departs 
from  90  degrees,  frequency  Increases,  or  distance  Increases.  Vt  was 
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concluded  chat  BIRs  could  not  be  a  useful  cue  beyond  15  feet  even 
with  optimal  azimuth,  frequencies,  and  environmental  factors.  It 
was  further  concluded  that  distance  estimation  with  binaural  cues 
may  permit  absolute  judgement"}  of  distance,  while  monaural  cues  tend 
to  require  experience  with  the  sound  source  to  estimate  distance. 

21.  Coleman,  P.  D.  (1968).  Dual  role  of  frequency  spectrum  In  determination 

of  auditory  distance.  The  Journal  of  the  Acoustical  Society  of 
America.  44.  631-6.32.  —  — 

It  was  suggested  that  the  frequency  spectrum  of  a  s.  u.  '  may  play  a 
dual  role  in  auditory  depth  perception.  At  distances  great*.?"  than  a 
few  feet,  relatively  greater  high-frequency  content  may  signify  a 
closer  sound  source  due  to  the  greater  attenuation  of  high 
frequencies  relative  to  low  frequencies  by  passage  through  air.  nt 
these  distances  the  wavefront  may  be  considered  planer,  and  the 
differential  attenuation  follows  the  exponential  loss  law.  At 
closer  distances,  however,  the  sound  field  must  be  considered 
spherical,  and  it  is  the  low-frequency  content  of  the  spectrum  that 
is  more  prominently  affected  by  changes  in  distance.  As  distance  is 
increased  in  this  range,  the  low-frequency  content  is  reduced  more 
than  the  high-frequency  content.  Therefore,  given  a  stationary 
sound  source  in  the  near  field,  a  relative  Increase  in  the  high 
frequencies  may  cause  the  source  to  appear  more  distant,  whereas  in 
the  far  field  a  relative  increase  in  the  high  frequencies  may  cause 
the  source  to  appear  closer.  The  article  cites  experimental 
evidence  in  support  of  this  dual  role  for  the  frequency  spectrum  in 
perception  of  auditory  distance. 

22.  Dirks,  D.  D. ,  &  Gilman,  S.  (1979).  Exploring  azimuth  effects  with  an 

anthropometric  manikin.  Journal  of  the  Acoustical  Society  of 
America,  66,  696-701. 

The  KEMAR  manikin's  eardrum  response  to  both  pure  tones  and  pink 
noise  was  measured,  and  comparisons  were  made  with  Shaw's  (1974) 
data  on  the  human  eardrum  response.  The  KEMAR  manikin's  eardrum 
response  was  comparable  to  Shaw's  data,  except  for  higher 
frequencies  (  3  8,000  Hz),  and  where  the  test  ear  was  shadowed  by  the 
head  ( 3  -60  to  -120  degrees). 


23.  -Easton,  R.  D.  (1983). 

auditory  dominance. 


The  effect  of  head  movements  on  visual  r.nd 
Perception.  12.  63-70. 


Two  experiments  were  performed  to  evaluate  the  effects  of  visual 
discordance  (created  by  looking  through  a  prism)  on  a  location 
task.  In  each  experiment  the  subjects  located  a  small  audio  speaker 
unlmodally  or  blmodally.  Half  the  subjects  in  each  experiment  were 
unaware  they  were  viewing  through  a  prism;  the  other  subjects  were 
informed  that  a  prism  was  used  and  observed  a  demonstration  of  its 
refracting  properties.  The  second  experiment  differed  from  the 
first  in  that  head  movement  was  allowed.  The  results  of  the  first 
experiment  indicated  that  knowledge  of  the  visual  discordance 
greatly  reduced  the  bias  toward  the  visual  modality  in  blmodal 
trials;  that  Is,  the  subjects  tended  to  rely  on  the  auditory  cues 
more  if  they  were  aware  their  vision  was  distorted.  In  the  second 
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experiment,  head  movements  did  not  affect  the  precision  of  visual 
localization,  but  did  reduce  Che  visual  bias  in  blmodal  trials  (from 
the  levels  observed  in  the  first  experiment)  for  no-knowledge 
subj ects. 
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(1957).  localization  of  high-frequency  tones.  The  Journal  of  the 
Acoustical  Society  of  America.  29.  988-991. 

Hie  role  of  lnteraural  time  and  amplitude  differences  (ITD  and  IAD 
respectively)  cues  in  auditory  lateralization  wee  investigated. 
Physical  measurements  of  ITD  and  IAD  over  various  frequencies  and 
azimuthal  positions  were  made.  The  subjects  were  presented  with  a 
noise  stimulus  of  preset  ITD,  and  a  pure  tone  whose  IAD  could  be 
adjusted*  The  task  was  to  match  the  apparent  location  of  the  noise 
and  pure  tone.  The  results  indicated  considerable  disparity  between 
the  matched  pure  tone  and  the  actual  physical  measurements  recorded 
from  the  corresponding  azimuthal  position.  The  dloparity  was  quite 
pronounced  at  low  frequencies  and  decreased  considerably  for  higher 
frequencies. 

25.  Firestone,  P.  A.  (1930).  The  phase  difference  and  amplitude  ratio  at 

the  ears  due  to  a  source  of  pure  tone.  The  Journal  of  the 
Acoustical  Society  of  America.  2_,  260-270. 

The  amplitude  and  phase  of  a  pure  tone  were  measured  at  the  two  ears 
of  a  manikin  for  various  azimuthal  and  distance  positions,  and  the 
amplitude  ratio  and  phase  difference  were  calculated.  In  all  cases, 
as  distance  decreased  the  amplitude  ratio  decreased  and  the  phase 
difference  increased.  Azimuthal  position  affects  were  dependent 
upon  distance  and  frequency.  In  general,  the  largest  phase 
difference  for  signals  below  1000  Hz  was  found  at  90  degrees,  and 
the  minimum  amplitude  ratio  was  greatest  at  "  90  degrees. 

26.  Flannery,  R. ,  &  Butler,  R.  A.  (1981).  Spectral  cues  provided  by  the 

pinna  for  monaural  localization  in  the  horizontal  plane.  Perception 
and  Psychophysics.  24.  438-444. 

The  apparent  location  of  1  kHz  wide  band  noise  burst  was  found  to  he 
dependant  on  the  center  frequency  of  the  bend.  As  frequency 
Increased  (4  to  9  kHz)  the  apparent  source  location  migrated  from  0 
to  90  degrees.  At  the  higher  frequencies  the  source  appeared,  for 
some  subjects,  to  revert  back  to  0°  and  proceed  again  towards  90°. 
These  patterns  of  apparent  location  were  labeled  spatial  reference 
maps  (SRMa).  Measurement  of  pinna  amplification  showed  a  positive 
relationship  between  the  apparent  location  and  the  amplification 
function. 


27.  Forbes,  T.  W.  (1946).  Auditory  signals  for  instrument  flying.  Journal 
of  the  Aeronautical  Sciences.  May,  255-258. 

In  an  aircraft  simulator,  with  both  naive  and  experienced  pilots, 
auditory  signals  were  successfully  combined  to  supplement  visual 
information.  The  three-in-one  signal  indicated  heading  variations 
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by  a  directional  sweep  of  Che  sound,  bank  Information  by  pitch,  end 
airspeed  by  rate  of  occurance.  The  results  Indicated  Chat  the 
thre#-in-one  signal  can  be  successfully  used,  with  practice,  to 
maintain  simulated  flight.  An  automatic  system  for  varbally 
announcing  various  instrument  readings  was  described.  Five 
principles  for  the  successful  implementation  of  auditory  signals  in 
the  cockpit  were  presented  and  dlscussad. 

28.  Freedman,  S.  J. ,  &  Fisher,  H*  G.  (1968).  The  role  of  the  pinna  in 

auditory  localization.  In  S.  J.  Freedman  (Ed.).  The  Neurophysiology 
of  Spatially  Oriented  Behavior.  (pp.  135-152).  Homewood,  IL: 
Dorsey  Press. 

A  series  of  experiments  was  undertaken  to  examine  the  role  of  the 
pinna  in  auditory  localisation.  Subjects  restrained  from  head 
movements  were  able  to  accurately  localise  only  if  their  own  pinnae 
or  artlflcal  pinnae  were  present.  Subjects  with  one  ear  occluded 
were  able  to  localize  accurately.  Even  when  the  interaural  time 
difference  cues  were  confounded  by  lengthening  the  Interaural  axis, 
subjects  could  still  localise  as  long  as  artlflcal  pinnae  were 
used.  These  findings  suggest  the  pinna  playa  a  major  tola  in 
auditory  localization  and  cast  doubt  on  the  slgnflcance  of 
localization  experiments  which  eliminats  the  role  of  the  pinna 
through  the  use  of  earphones. 

29.  Freedman,  S.  J. ,  &  Pfaff,  D.  W.  (1962).  The  affect  of  dlchotlc  noise  on 

auditory  localization.  The  Journal  of  Auditory  Research.  2,  305- 
310. 

This  study  investigated  the  effects  of  noise  and  motility  on  a 
dlchotlc  time  discrimination  task.  The  time  discrimination  task 
consisted  of  obtaining  difference  thresholds  both  before  and  after 
exposure  to  noise  and  motility  conditions.  Three  motility 
conditions  were  employed  in  a  within-subjects  design.  These  were 
(1)  ambulatory  -  the  subject  walkad  through  a  busy  corridor  in  5 
minute  periods  while  listening  to  dlchotlc  noise  stimuli,  (2) 
recunbent  -  the  subject  layed  in  bed  without  gross  body  movements, 
and  (3)  passive  movement  -  the  subject  was  wheeled  through  the  busy 
corridor  as  in  the  ambulatory  condition.  It  was  predicted,  based 
upon  visual  displacement  studies,  that  performance  in  dlchotlc  time 
discrimination  would  decrease  as  noise  exposure  and  active  movement 
increased.  These  results  were  supported. 

30.  Freldes,  D.  (1974).  Human  Information  processing  and  sensory 

modality:  Cross-modal  functions.  Information  complexity,  memory  and 
deficit.  Psychological  Bulletin.  81.  284-310, 

The  theoretical  and  empirical  ralatlonahips  between  seneory  modality 
and  information  processing  were  reviewed.  It  was  suggested  that 
performance  differences  between  modalities  are  dependent  upon  the 
informational  complexity  of  the  stimulus.  Performance  tends  to  be 
equal,  regardless  of  modality,  if  the  stimuli  ore  relatively  simple. 
More  complex  stimuli  tend  to  show  a  performance  advantage  for  a 
particular  modality. 
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31.  Gardner,  M.  B.  (1969a).  Distance  eotlmation  of  0°  or  apparent  0°- 

orlented  speech  signals  In  anecholc  space.  The  Journal  of  the 
Acoustical  Society  of  America.  45  .  47-53. 

Tha  ability  of  observers  to  estimate  the  distance  of  speech  signals 
was  Investigated  In  a  series  of  experiments.  The  apparent  distance 
of  the  source  was  varied  by  manipulating  either  the  Intensity  of  the 
ssurca  or  the  actual  distance  from  the  source  to  the  listener.  Both 
recorded  and  live  voices  were  used  as  signals.  Using  live  voices 
which  varied  in  Intensity  (i.e.,  whisper,  quiet  speech, 

conversational  speech,  or  shouting),  observers  were  quite  accurate 
at  judging  the  distance  of  the  sound,  although  whispered  speech 
tended  to  be  underestimated  and  shouted  speech  tended  to  produce 
overestimation.  Recorded  voices,  In  all  conditions,  tended  to 

produce  poor  distance  estimates. 

32*  Gardner,  H.  B.  (1969b).  Imsge  fusion,  broadening,  and  displacament  in 

sound  localisation.  Tha  Journal  of  the  Acoustical  Soclaty  of 

America.  46,  339-349. 

Tha  various  methodologies  for  producing  "phantom"  sound  locations  In 
stereophonic  (two  speaker)  systems  are  reviewed.  Techniques  for 

producing  stereophonic  effects  from  monophonic  signals  are 
discussed. 

33.  Gardner,  M.  B.  (1973).  Some  monaural  and  binaural  facets  of  median 

plane  localisation.  Tha  Journal  of  tha  Acoustical  Society  of 

America.  54 ,  1409-1495. 

This  article  examined  the  localization  of  noist-band  signals  In  the 
anterior  sector  of  the  median  plane.  Both  subjective  observations 
end  objective  measurements  (using  a  manikin  head)  were  obtained  In 
an  anacholc  chamber.  Optimum  localisation  accuracy  was  obtained  in 
binaural  conditions  with  the  csvltiss  of  both  pinnae  entirely 
unoccluded.  In  monaural  conditions  the  apparent  location  of  the 
sound  landed  to  shift  ewey  from  the  median  plane;  hence,  binaural 
receptUn  seema  to  be  important  in  maintaining  the  proper  azimuthal 
position  for  median  plans  sources.  Occlusion  of  the  cavities  in  the 
pinna  (laaving  the  ear  canal  open)  was  found  to  degrade 
localization,  suggesting  that  the  pinna  cavities  provide  Important 
monaural  cues.  Measurements  taken  at  the  manikin  sardrun  revealed 
that  amplitude  differencaa,  ae  a  function  of  elevation  angle,  are 
too  small  to  account  for  more  than  a  small  part  of  tha  localization 
accuracy  obtained  in  this  sector  of  the  median  plane.  Differences 
In  frequency  responee  charecteristlcs  were  found  to  be  more 
•ubetantlal,  and  thus  a  possible  source  of  cues. 

34,  Gardner,  M.  B. ,  4  Gardner,  R.  S.  (1973).  Problem  of  localization  In  the 

median  plane:  effect  of  pinnae  cavity  occlusion.  The  Journal  of 
the  Acoustical  Society  of  America,  53,  400-408. 

This  study  examined  the  effect  of  pinna  cavity  occlusion  on 
localization  in  tha  anterior  and  posterior  ractors  of  the  median 
plana.  Rasulta  indicated  that  (1)  localization  accuracy  decreases 
as  the  degree  of  pinna  cavity  occlusion  Increases,  (2)  accuracy  is 
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better  for  broad-band  noise  signals  than  narrowband  signals  for  all 
degrees  of  occlusion,  (3)  accuracy  increases  as  the  frequency  of  the 
noise  band  Increases,  and  (4)  greater  accuracy  is  possible  in  the 
anterior  than  the  posterior  sector  of  the  median  plane.  It  was 
concluded  that  differences  in  pinna  transforms,  as  a  function  of 
elevation  angle,  are  an  important  source  of  cues  for  localization  in 
the  median  plane. 
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factors  in  undersea  warfare  (pp.  201-217).  Washington,  DC: 
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This  review  discussed  the  feasibility  of  using  auditory  signals  to 
reduce  visual  workload  in  a  submarine  scenario.  The  various  types 
of  auditory  information  were  discussed,  and  basic  psychological 
attributes  employed  in  encoding  the  auditory  information  were 
emphasized.  The  advantages  and  disadvantages  of  auditory  signalling 
were  identified. 

36.  Gatehouse,  R.  W.  (1982).  Summary:  New  directions.  In  R.  W.  Gatehouse 

(Ed.),  Localization  of  Sound:  Theory  and  Application  (pp.  267- 

270).  Croton,  CT:  Amphora  Press. 

The  summary  remarks  identified  several  key  areas  of  research  needed 
In  auditory  localization.  Generally,  the  need  to  study  ecologically 
valid,  and  dynamic  (rather  than  static)  localisation  situations  was 
stressed.  In  part,  research  needs  in  distance  location,  moving 
sound  and  moving  listener  effects,  duration  of  stimulus  effects,  and 
masking  effects  were  identified. 

37.  Gopher,  D.  (1973).  Eye-movement  patterns  In  selective  listening  tasks 

of  focused  attention.  Perception  and  Psychophysics .  14 .  259-264. 

This  article  examined  the  role  of  eye  movements  in  dichotlc 

listening  casks.  During  the  presentation  of  auditory  messages  the 
spontaneous  occurrences  of  eye  movements  were  reduced,  and  the  eye 
movements  and  fixations  which  did  occur  tended  to  be  in  the 
direction  of  Che  ear  receiving  the  message.  When  the  dichotlc 

switching  cask  was  changed  to  a  monaural  presentation  mode,  the  aye 
movements  during  message  presentation  tended,  unlike  before  ,  to 
remain  centrally  located.  It  was  concluded  that  the  elimination  of 
dichotlc  competition  reduced  Che  orienting  response  towards  the 
signal.  In  an  effort  to  increase  listening  demands,  monaural 

presentation,  with  increased  message  frequency,  was  included.  The 
results  indicated  that  Increasing  the  difficulty  of  the  task  did  not 
Increase  the  eye  movement  orienting  response. 

38.  Gotoh,  T.  (1982).  Can  the  acoustic  head  related  transfer  function 

explain  every  phenomenon  in  sound  localization.  In  R.  W.  Gatehouse 
(Ed.),  Localization  of  Sound:  Theory  and  Application  (pp.  244- 
248).  Groton,  CT:  Amphora  Press. 


An  application  for  psychoacouscically  derived  head-related  transfer 
functions  to  Improve  stereophonic  reproduction  was  presented.  Both 
diotlc  and  dichotlc  transform  information  was  necessary  to  produce 
accurate  sound  localization  from  a  stereophonic  system. 
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39.  Grantham,  D.  W. ,  &  Wightman,  F.  L.  (1978).  Detectability  of  varying 

interaural  temporal  differences.  The  Journal  of  the  Acoustical 
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This  study  Investigated  the  effects  of  interaural  time  differences 
(lTDs)  in  detecting  intracranial  movement.  Two  parameters  of  the 
itimuli  were  varied.  These  were  the  modulation  frequency  (fm)  that 
is,  the  rate  of  movement,  and  the  peak  sinusoidal  difference  in  ITDs 
(AT)  that  is,  the  extent  of  movement.  The  results,  using  a  2  forced 
choice  procedure,  indicated  that  the  magnitude  of  At  necessary  for 
movement  detection,  increased  as  fm  increased.  The  encoding  of 
temporal  cues  was  characterized  as  a  relatively  slow,  Low-pass 
filter. 

40.  Greene,  0.  G.  (1968).  Comments  on  "Perception  of  the  range  of  a  sound 

iburc*  of  unknown  strength."  The  Joutrtal  of  the  Acoustical  Society 
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this  paper,  using  Hirsh's  (1968)  equation  to  determine  the  distance 
of  a  sound  source,  showed  that  the  relative  error,  under  optlmel 
conditions,  would  produce  4 SX  uncertainty  in  range  estimate.  It  was 
concluded  that  Hirsh's  equations  Were  unreliable  as  a  cue  for 
distance  localization. 

41.  Hebrank,  J. ,  &  Wright,  D.  (1974a).  Are  two  ears  necessary  for 

localization  of  sound  sources  on  the  median  plane?  The  Journal  of 
the  Acoustical  Society  of  America,  56 .  935-938. 

Localization  in  the  median  plane  has  often  been  attributed  to  pinna 
transformations  of  the  incoming  source,  due  to  the  lack  of 
interaural  intensity  and  time  differences.  However,  binaural 
localization  has  been  found  to  bo  superior  to  monaural  localization, 
indicating  that  the  binaural  comparison  of  the  wavefronts  at  each 
ear  is  a  necessary  cue  for  localization.  experiment  1  compared 
binaural  and  monaural  performance  with  either  familiar  or  unfamiliar 
sounds.  Binaural  performance,  as  expected,  was  superior  to 
monaural,  and  familiar  sounds  were  more  accurately  localized  than 
unfamiliar  sounds.  Experiment  2  evaluated  the  change  in  performance 
in  monaural  localization  with  training  and  feedback.  The  training 
resulted  in  monaural  localization  accuracy  improving  to  previously 
measured  binaural  performance.  The  necessity  of  two  ears  for  median 
plane  localization  was  rejected. 
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the  Acoustical  Society  of  America.  56.  1829-1834. 

Three  experiments  investigated  the  role  of  frequency  spectra  on 
sound  localization  in  the  median  plane.  The  experiments  Identified 
the  features  encoded  .by  the  external  ears.  The  results,  as 
expected,  differed  according  to  the  frequency  spectra  presented. 
Frontal  localizations  were  perceived  with  one  octave  notch  whose 
lower  cutoff  frequency  was  between  4  and  8  kHz.  Localizations  above 
the  listener  occurred  for  sound  sources  of  V4  octave  bands  which 
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peaked  between  7  and  9  kHz.  Locations  behind  the  listener  were 
perceived  with  frequency  bands  which  peaked  between  10  and  12  kHz. 
In  general,  che  cues  for  median  plane  localization  reside  between  4 
and  12  kHz;  however  the  specific  relationship  between  notch 
frequency  and  perceived  auditory  elevation  wsb  not  clear. 
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Reaction  times  to  sounds,  lights,  and  sound-light  combinations,  were 
measured.  In  the  light  and  sound  combination  condition  the  light 
always  preceded  the  sound,  and  the  degree  of  onset  asynchrony  was 
varied.  The  results  indicated  that  the  light  and  sound  combination 
facilitated  reaction  time,  and  that  the  greatest  facilitation 
occurad  whan  the  asynchrony  was  aqual  to  the  difference  in  reaction 
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It  was  mathematically  demonstrated  that  the  distance  of  a  sound, 
regardless  of  its  initial  amplitude,  could  be  calculated  by 
intaraural  time  and  amplitude  differences  between  the  two  ears.  The 
relationship  between  interaural  differences  and  the  distance  of  a 
source  was  not  supported  by  any  psychophysical  data.  Therefore,  the 
utility  of  this  calculation  cue  is  unknown.  A  hypothetical 
psychophysical  experiment  was  proposed  to  explore  this  possibility. 
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Noise  bursts,  which  controlled  for  loudness  cues,  were  presented  at 
various  distances  from  an  observer.  Subjects  whose  right  asr  faced 
cowards  the  sound  source  were  able  to  accurately  rank-order  the 
distance  of  the  stimuli.  However,  sublects  who  directly  faced  the 
sound  source  were  unable  to  discriminate  between  distances.  Thaee 
results  have  implications  for  lnt.raural  time  and  amplitude 
difference  cues. 

46.  Jones,  B. ,  &  Kabanoff,  B.  (1975).  Eye  movements  in  auditory  apace 

perception.  Perception  &  Psychophysics.  1  7.  241-24  5. 

The  effects  of  eye  movement!  and  their  correspondence  to  auditory 
information  were  investigated  using  a  signal  detection  paradigm. 
Experiment  1  evaluated  the  effects  of  localization  with  eye 
movements  fixed  or  free,  and  found  that  eye  movements  improved 
localization.  Experiment  2  introduced  congruent  or  discrepant 
visual  information  to  assess  the  effect  of  eye  movements.  The 
results  indicated  a  significant  difference  between  congruent  and 
dlscrepent  information,  but  Che  effect  of  eye  movements  was  not 
significant.  Therefore,  it  was  suggested  that  it  is  not  eye 
movements,  per  se,  that  affect  localization  performance,  but  rather 
the  congruence  between  eye  movements  and  location.  Experiment  3 

L  95 


kW  .  a  .  k  -  aVbtu  •**  \.yxy 


substituted  visual  information  with  verbal  instructions  to  move 
their  eyes  one  direction  or  the  other,  in  order  to  assess  the 
effects  of  eye  movements  in  the  absence  of  visual  cues.  The  results 
again  support  the  notion  that  eye  movements  affect  the  localization 
of  auditory  signals  even  in  the  absence  of  visual  information. 

47.  Kock,'  W.  E.  (1950).  Binaural  localization  and  masking.  The  Journal  of 

the  Acoustical  Society  of  America,  22 .  801-804.  ~ 

This  study  investigated  the  role  of  interaural  time  differences 
(ITDs)  in  the  discrimination  of  blnaurally  praaentad  speech  signals 
from  noise.  In  general,  a  reduction  in  ITD  increased  the  difference 
threshold  for  identifying  speech.  It  was  concluded  that  differences 
in  arrival  cues  allow  for  sound  localization,  and  improve  diacrimin- 
ability  of  signals  from  noise. 

48.  Kuhn,  G.  P.  (1977).  Model  for  the  interaural  elms  differences  in  the 

esimuthal  plane.  The  Journal  of  the  Acoustical  Society  of  America. 
62,  157-167.  ~ 

Interaural  time  and'  amplitude  differences  (ITD  and  IAD, 
respectively)  were  measured  in  a  KEMAR  manikin  at  various  azimuthal 
positions.  Comparisons  between  measured  and  theoratical  ITD  valuaa 
were  made,  and  vara  concludad  to  ba  raasonably  similar.  The  data 
support  that  ITD  is  Independent  of  frequency  both  below  500  Hz  and 
above  3000  Hz.  ITDs  were  minimal  batwman  1400-1600  He  for  azimuthal 
positions  between  15  and  60  degrees.  These  results  support  past 
findings,  which  concluded  that  localization  was  poor  between  1000- 
2000  Hz,  and  that  tha  localization  cue  of  IAD  waa  used  above  1400 
Hz. 


49.  Kuhn,  G.  F.  (1982).  Towards  a  modal  for  sound  localization.  In  R.  W. 

Gatehouse  (Ed.),  Localization  of  Sound;  Theory  and  Applications 
(pp.  31-64).  Groton,  CT:  Amphora  Preas. 

A  mathematical  modal  of  sound  localization  cues,  based  on  interaural 
time  differences  (ITD)  for  low  frequency  Information,  and  interaural 
amplitude  differences  (IAP)  for  high  frequency  information,  was 
presented.  The  model  was  supported  by  changes  in  ITD  and  IAD  for 
various  hearing  aid  placements. 

50.  Lambert,  R.  M.  (1974),  Dynamic  theory  of  sound-source  localization. 

The  Journal  of  tha  Acoustical  Society  of  America.  56.  165-171. 

The  effects  of  heed  movement  on  the  localization  of  sound  source 

azimuth  and  range  was  mathematically  modeled.  In  binaural 

situations,  interaural  time  differences  (ITD)  will  systemat IcaL Ly 
change  as  head  position  is  changed.  The  difference  between  ITDs  on 
two  successive  samples  of  the  etlmulua,  and  knowledge  of  the  degree 
of  head  movement  from  sample  1  to  sample  2,  will  generate 

mathematical  solutions  for  determining  azimuth.  In  terms  of 
binaural  range  perception,  mathematical  solutions  to  distance  based 
upon  Interaural  amplitude  differences  (IAD)  sampled  between 
successive  head  movements  were, generated.  Attention  was  paid  to  the 
limitations  Imposed  by  human  interaural  time  and  amplitude 
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difference  thresholds,  and  the  parameters  (range  and  angle)  over 
which  Chase  cues  could  operate.  In  terms  of  theory  It  is  unclear  if 
observers  calculate  these  changes  or  use  learned  relationships  to 
determine  azimuth  and  range.  In  addition,  a  two  factor  theory  of 
localization  was  proposed,  in  which  low-frequency  Information  is 
utilized  if  head  movements  are  available,  and  high-frequency 
information  is  utilized  when  head  movements  are  constrained. 

51.  Levy,  E.  T. ,  &  Butler,  R.  A.  (1978).  Stimulus  factors  which  influence 

the  perceived  externalizatlon  of  sound  presented  through 

headphones.  The  Journal  of  Auditory  Research.  18.  41-50. 

In  two  experiments,  the  stimulus  factors  Involved  in  the 
externalizatlon  of  sound  presented  through  headphones  was 
examined.  Experiment  1  varied  Che  inceraural  time  differences 
(1TD),  interaural  spectral  differences  (ISD),  and  lnteraural 
amplitude  differences  (IA0)  available  In  the  stimulus.  Three 
stimulus  tapes  were  createdi  Tape  l  contained  ITD,  ISD,  and  IAD 
cues,  tape  2  contained  ITD  and  ISC  cues,  and  tape  3  contained  only 
ITD  cues.  In  each  condition  frequency  composition  of  the  noise 
bursts  ware  varied  from  .5  to  A  kHz .  The  results  indicated  a 
significant  effect  of  frequency  composition,  with  high  frequency 
sounds  being  judged  as  closer  to  the  head  than  low  frequency 
sounds.  No  effect  of  lnteraural  cue  content  was  found.  The  second 
experiment  decomposed  ITD  cues  into  lnteraural  arrival  differences 
(IARD)  and  lnteraural  ongoing  differences  (IOND).  One  tape 

contained  both  IARD  and  IOND  cues  (l.a.,  tape  3  from  experiment  1) 
and  the  second  eliminated  IARD  cues  from  tape  3.  The  results  showed 
a  significant  affect  of  frequency  as  in  experiment  1,  and  a 
significant  difference  between  frequency  composition  cues.  The  IOND 
cues  alone  condition  was  perceived  as  being  closer  to  the  head  than 
the  IOND  and  IARD  condition.  It  was  concluded  that  the  perception 
of  externalized  sound  can  occur  even  when  many  of  the  natural  cues 
are  eliminated. 

52.  Loudsbury,  B.  F. ,  &  Butler,  R.  A.  (1979).  Estimation  of  distances  of 

recorded  sounds  presented  through  headphones.  Scandinavian 
Audiology.  _8,  145-149. 

Two  experiments  investigated  the  role  of  frequency  composition, 
monaural  vs.  binaural  presentation,  and  azimuthal  position  in  an 
auditory  distance  estimation  task.  Subjects  received  noise  bursts 
(270  degrees  azimuth)  recorded  at  the  eardrun  of  other  individuals. 
The  stimuli  varied  as  to  the  amount  of  head  shadow  available  from 
distances  of  2  to  10  feat.  Stimuli  were  presented,  in  a  within 
subjects  design,  both  monaurally  and  blnaurally  for  1  kHz  low-pass 
noise  bursts,  unfiltered  noise  bursts,  and  4  kHz  high-pass  noise 
bursts.  Subjects  were  given  one  reference  tone  at  5  feet  as  a 
comparison  stimulus.  The  second  experiment  was  identical  to  the 
first,  except  that  azimuthal  position  was  varied  (360,  330,  300,  and 
270  degrees).  Nine  out  of  48  subjects  in  the  two  experiments  tended 
to  reverse  their  judgements  of  distance.  These  subjects,  termod 
Inverters,  were  analyzed  separately  from  the  non-inv8rters.  The 
results  showed  that  non-inverters  accurately  estimated  the  distance 
of  the  sound  source,  especially  at  higher  frequencies  and  at  330 
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degrees  azimuth.  Monaural  versus  binaural  presentation  had  llttLe 
affect  upon  performance.  It  was  concluded  that  the  Intensity  ratio 
of  direct  to  reflected  sound  can,  when  loudness  cues  are 
unavailable,  be  used  as  a  relative  cue  for  distance  estimation. 

53.  Loveless,  N.  3.,  Brebner,  J. ,  &  Hamilton,  P.  Bisensory  presentation  of 

information.  Psychological  Bulletin.  73,  161-199. 

The  literature  concerning  biaen'ory  presentation  of  information  was 
reviewed  and  interpreted  according  to  signal  detection  theory.  In 
general,  it  was  assumed  that  the  addition  of  redundant  information 
would,  due  to  probability  summation,  reduce  detection  thresholds. 
Pour  experiments,  using  a  signal  detection  paradigm,  were  conducted 
to  investigate  unlreodal  and  blmodal  performance  as  the  information 
redundancy  between  the  signals  Is  varied.  In  general,  the  results 
Indicated  that  redundancy  between  the  signals  la  Important;  however, 
the  effect  differs  widely  between  subjects. 

54.  Mastroiannl,  G.  R.  (1982).  The  Influence  of  eye  movements  and 

illumination  on  auditory  localisation.  Perception  and 

Paychophyelcs.  31,  581-584. 

This  etudy  critically  evaluated  Shelton  and  Searle'e  (1980) 
conclusions  on  visual  facilitation  of  auditory  localization.  An 
experiment  was  designed  to  evaluate  the  frame  of  reference 
hypothesis  and  memory  stabilization  hypothesis.  The  crossed 
Independent  variables  in  a  repeated  measures  design  were 
Illumination  (light  or  dark)  and  eye  movements  (permitted  or  not 
permitted).  An  ANOVA  on  mean  error  scores  indicated  that  only  eye 
moveraente  In  a  lighted  environment  Improved  performance.  In 
general,  the  results  tended  to  lend  more  support  to  the  frame  of 
reference  position, 

55.  Matin,  L.  (1982a).  Visual  and  auditory  localization:  Normal  and 

abnormal  relations.  In  G.  T.  Chiaum  &  P.  B.  Morway  (Eds.),  Research 
Program  Review:  Aircraft  Physiology  (Report  No.  NADC-822^2-60). 

Warminster,  PA:  Naval  Air  Development  Center. 

This  article  reviewed  several  studies  on  how  the  coordination  of 
bleeneory  localization.  is  accomplished  and  identified  possible 
mechanisms.  The  studies,  using  auditory/ visual  matching  tasks,  used 
three  populations  of  subjects:  strabismic,  bilateral  asymmetric 

hearing  deficits,  and  normal  subjects.  The  article  defined  visual 
capture,  vinual-fleld  suppression,  and  cancellation  mechanisms  for 
visual  localization.  Three  theoretical  cancellation  mechanisms  - 
inflow,  outflow,  and  hybrid  theory  -  were  identified. 

56.  Matin,  L. ,  Picoult,  E. ,  Stevens,  J.  K.  ,  Edwards,  M.  W. ,  Young,  D.  ,  6 

MacArthur,  R.  (1982).  Oculoparalytic  Illusion:  Visual-field 

dependent  spatial  mlslocalizations  by  humans  partially  paralyzed 
with  curare.  Science ,  216,  198-201. 

The  oculoparalytic  illusion  (which  occurs  under  curare-induced 
paralysis)  was  discussed.  The  illusion  occurs  only  in  darkness,  and 
can  be  described  as  a  misperception  of  che  location  of  a  fixated 
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light.  Restoration  of  illumination  restores  the  appearance  of  the 
light  to  its  actual  position,  and  apparently  no  amount  of  training 
influences  this  illusion.  Evaluation  of  the  cause  of  the  illusion 
centered  upon  whether  the  angle  of  the  eye  in  the  head  or  the  angle 
ot'  the  head  and  body  with  respect  to  gravity  were  involved. 
Systematic  variations  of  both  angles  concluded  that  the  former  was 
responsible.  Another  study  evaluted  the  effects  of  this  illusion  on 
auditory  localization.  In  normal  illumination,  paralyzed  observers 
were  accurate  in  positioning  a  visual  target  to  the  median  plane. 
However,  in  matching  a  sound  source  location  to  a  light,  large  gaze- 
dependent  errors  occurred. 

Matin,  L. ,  Stevens,  J.  K. ,  &  Picoult,  E.  (1983).  Perceptual 

consequences  of  experimental  excraocular  muscle  paralysis.  In  A. 
Hein  &  M.  Jeannerod  (Eds.),  Spatially  Oriented  Behavior.  New  York, 
Springer-Verlag. 

Complete  data  la  provided  on  the  causal  relationship  between  the 
oculoparalytlc  Illusion  and  the  angle  of  the  eye  in  the  head.  In 
terms  of  auditory-visual  matches,  curarized  subjects  In  full 
illumination  tended  to  make  localisation  match  errors  related  to 
gaze  position.  The  errors  were  attributed  to  a  suppression  of  the 
output  of  extraretlnal  eye  position  information. 

McFadden,  D. ,  &  Pasanen,  E.  G.  (1975).  Binaural  beats  at  high 

frequencies.  Science ,  190,  3S4-396. 

Evidence  is  presented  which  suggests  that  binaural  beats  (previously 
obtained  only  for  low-frequency  sinusoidal  input*)  can  be 
experienced  at  higher  frequencies  if  complex  waveforms  are  used. 
The  phenomenon  of  binaural  beats  Is  characterized  by  the  perception 
of  a  single  beat  created  by  the  presentation  of  two  separate  tones, 
which  differ  in  frequency,  to  different  earB.  In  some  ways  the 
binaural  beats  found  for  high-frequency  complex  waveforms  were 
slmiliar  to  beats  perceived  with  low-frequency  sinusoids.  However, 
there  were  several  ways  in  which  they  differed.  High  frequency 
beats  tended  to  be  fainter,  to  be  perceived  over  a  larger  range  of 
frequencies,  and  to  alter  pitch  perception.  It  was  concluded  that 
the  beating  phenomenon  is  probably  based  upon  the  same  mechanisms 
that  account  for  low-frequency  beats. 

McFadden,  D. ,  &  Pasanen,  E.  G.  (1976).  Lateralization  at  high 

frequencies  based  on  interaural  time  differences.  The  Journal  of 
the  Acoustical  Society  of  America,  59,  634-639. 

A  series  of  experiments  investigated  the  effects  of  high  frequency 
complex  waveforms  on  auditory  lateralization.  A  single-interval- 
forced-choice  paradigm  was  employed.  The  task  consisted  of 
determining  the  side  to  which  two  signals,  one  of  which  was  time 
delayed  to  either  the  right  or  left  ear,  were  lateralized.  In 
general,  the  results  support  the  hypothesis  that  the  auditory 
system,  contrary  to  the  classical  duplex  theory  ot  localization,  is 
sensitive  to  ongoing  interaural  time  differences  (ITDs)  at  high 
frequencies.  Several  parameters  of  the  wavefront  were 

investigated.  These  were  center  frequency,  use  of  two-tone  complex 


signals,  depth  of  modulation,  and  simultaneous  presentation  of 
complex  wavefronts  which  differed  in  center  frequency.  It  was 
concluded  that  the  auditory  system  is  sensitive  to  ITD  at  high 
frequencies  when  complex  sound  sources  are  employed,  and  that  the 
analysis  of  ITD  effects  must  differentiate  between  arrival 
differences,  ongoing  time  differences,  and  envelope  time 
differences. 

60.  McNulty,  J.  A.  (1982).  Underwater  sound  and  human  hearing.  In  R.  W. 

Gatehouse  (Ed.),  Localization  of  Sound:  Theory  and  Applications 
(pp.  250-266).  Groton,  CT:  Amphora  Press. 

The  ability  of  humans  to  localize  sounds  underwater  was  assumed  to 
be  relatively  poor.  It  was  felt  that  the  change  in  medium  would 
distort  the  classical  localization  cues  such  aa  head  shadow  and 
lntaraural  time  and  intensity  differences.  In  a  series  of  seven 
experiments,  the  ability  to  localize  sounds  and  use  echo  Information 
to  determine  the  distance  of  a  sound  source,  while  underwater,  was 
shown.  Accuracy  was  increased  through  training,  practice,  and  under 
anachoic  conditions  (i.e.  open  ocean). 

61.  Mshrgardt,  S.,  &  Mellert,  V.  (1977).  Transformation  characteristics  of 

the  external  human  ear.  The  Journal  of  the  Acoustical  Society  of 
America.  61.  1567-1576. 

Transfer  function  (amplitude  and  phase)  measurements  were  taken  over 
a  frequency  range  of  200  to  15000  Hz  for  both  the  horizontal  and 
median  planes.  Measurements  were  made  2.  mm  inside  the  ear  canal. 
Tha  results  agreed  well  with  previous  studies.  At  azimuths  between 
0  and  -54°  (away  from  tha  ear)  an  amplitude  dip  is  present  at  1  kHz, 
due  to  head  diffraction.  For  positive  azimuthal  values,  the 
amplitude  increased  relative  to  the  free  field.  In  the  vertical 
plane,  the  only  Increase  in  amplitude  is  at  8  kHz,  at  90s.  This 
corresponds  to  the  finding  that  noise  bands  in  this  region  are 
perceived  aa  from  above.  These  results  have  implications  for  the 
transformations  (phase  and  amplitude)  the  pinna  performs  on  incoming 
signals. 

62.  Metcalfe,  J.  ,  Glavanov,  D. ,  &  Murdock,  M.  (1981).  Spatial  and  temporal 

processing  in  the  auditory  and  visual  modalities.  Memory  & 
Cognition.  9_,  351-354. 

Three  studies  evaluated  tha  effects  of  input  modality  and  type  of 
recall.  Visual  and  auditory  modalities,  and  spatial  and  temporal 
recall  were  fully  crossed  variables  in  a  repeated  measures  design. 
In  all  three  experiments  spatial  recall  was  superior  to  temporal 
recall  in  the  visual  modality,  and  temporal  recall  was  superior  to 
spatial  recall  In  the  visual  modality.  It  was  concluded  that  each 
modality  is  sensitive  to  different  aspects  of  the  stimulus  array. 

63.  Mills,  A.  W.  (1958).  On  the  minimum  audible  angle.  The  Journal  of  the 

Acouatlcal  Society  of  America,  30 ,  237-246. 

This  study  measured  the  difference  threshold  for  locating  the 
direction  of  a  source.  Using  tonal  stimuli,  ranging  from  250  to 
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10,000  Hz,  it  was  found  that  the  minimum  audible  angle  (MAA)  was 
dependent  upon  both  frequency  and  Azimuthal  position  (measured 
between  0  and  90  degrees).  The  MAA  is  approximately  1  degree  for  a 
source  at  0°  azimuth  and  between  500-750  Hz.  Increases  in  azimuth 

Increased  the  MAA.  The  effects  of  frequency  were  dependent  upon 

azimuth,  but  in  general  tended  to  be  cyclic  in  nature.  In  the 

median  plane  the  effects  of  frequency  were  airalliar  to  that  found  in 
the  horizontal  plane.  Interaural  phase/time  differences  (ITD),  for 
low  frequencies,  and  interaural  amplitude  differences  (IAD),  for 
higher  frequencies,  were  obtained  and  compered  at  varloua  MAAa  at 
different  azimuthal  positions.  The  results  indicsted  that  the  ITD 
threshold,  under  optimal  MAA  conditions,  was  approximately  10 
Usee.  The  IAD  threshold,  again  under  optimal  MAA  conditions, 
averaged  .5  db. 

Mills,  A.  W.  (1963).  Auditory  perception  of  spatial  relatione.  In 
Proceedings  of  the  International  Congress  on  Technology  and 

Blindnaam,  Vol.  2  (!pp.  111-139).  New  York}  American  Foundation 

for  the  Blind. 

This  review  Identified  varloua  factors  in  auditory  localisation,  end 
discussed  neuronal  models  to  account  for  them.  A  comprehensive 
review  of  the  work  of  Fsddersen,  Sandel,  Tees,  end  Jeffrese  (1957), 
and  Sandal,  Teas,  Fsddersen  end  Jeffress  (1955)  on  the  localisation 
of  pure  tones  revealed  that  the  localization  of  low  frequency  tones 
(<  1500  Hz)  was  accomplished  primarily  through  interaural  time 
differences  (ITDs).  The  poor  localisation  of  tones  around  1500  Hz 
was  attributed  to  phaee  ambiguities,  the  gradual  breakdown  of 
neuronal  synchrony  above  1000-1500  Hz,  end  the  lack  of  ITD  cues 
based  on  the  waveform  envelope  for  the  sound  waves  used  in  the 
studies.  This  review  also  covered  the  work  of  Wallach  (1940)  on 
heed  movement  effects.  Echo  localization  end  the  precedence  effect 
was  also  revlewad.  Dallanbach's  (1953)  work  on  echo  reflections  was 
reviewed,  as  wore  the  natural  echoes  produced  by  the  pinna  (Betteau, 
1963,  1965).  It  was  hypothesized  that  these  pinna  reflections  could 
be  peripherally  encoded  on  the  baelllar  membrane  if  it  was  assumed 
chat  some,  but  not  all,  of  the  neurons  fired  each  time  a  deflection 
was  made  on  the  baslllar  membrane.  The  two  sets  of  time  varying 
neurons  would  then  combine  information  into  an  autocorrelation 
matrix.  The  autocorrelation  function  would  encode  Che  pattern  of 
excitation,  similier  to  models  of  pitch  perception  encoding,  and 
deduce  the  source  location. 

Moltno,  (  1973).  Percalving  the  range  of  a  sound  when  direction  is 
known.  The  Journal  of  the  Acoustical  Society  cf  America.  53.  1301- 
1304. 

Iltrsh's  (1968)  derivation  of  the  role  of  Interaural  time  and 
amplitude  differences  In  the  perception  of  distance  was  questioned. 
In  essence,  Che  correlation  between  Interaural  time  and  amplitude 
differences  (ITD  and  IAD,  respectively)  results  in  the  distance 
estimate  being  contingent  upon  both  rhe  source  amplitude  and 
azimuth.  In  order  to  use  Hirsh's  equation  relating  IAD  and  ITD,  one 
of  the  above  parameters  must  be  Independently  available  to  the 
observer.  Hirsh's  model  was  tested  by  using  an  array  of  speakers  at 


various  distances  (3  to  48  feet)  which  were  calibrated  to  control 
for  the  1/R  Loss  Law.  The  results  indicated,  contrary  to  Holt  and 
Thurlow' s  (1969)  data,  that  observers  with  their  right  ear  towards 
the  speaker  were  unable  to  make  reliable  judgements  baaed  on  ITD  and 
ZAD  Information.  The  discrepancy  between  experiments  was  attributed 
to  procedural  and  qieasurement  differences. 

66.  Morimoto,  M. ,  &  Ando,  Y.  (1982).  On  the  slmuletion  of  sound 

localisation.  In  R.  W.  Gatehouse  (Bd.),  Localization  of  Sound: 
Theory  and  Application  (pp.  85-98).  Groton,  CT:  Amphora  Press. 

The  simulation  of  sound  localization  using  measured  head  related 
transfer  functions  (HRTP)  of  three  subjects  was  reported.  Subjects 
were  tasted  in  both  the  median  and  horizontal  plane.  The  simulated 
localization,  in  the  horizontal  plane,,  produced  results  comparable 
to  real  sound  sources  when  the  subject's  own  HRTF  was  used.  When 
another  subject's  HRTF  waa  used,  localization  ability  decreased. 
Simulated  localization  was  more  accurate  in  the  horizontal  plane 
than  the  median  plane. 

67.  Housheglan,  G. ,  &  Jeff  rasa,  L.  A.  (1959).  Role  of  lnteraural  time  and 

intensity  differences  in  the  lateralization  of  low-frequency  tones 
(Report  No.  DRL-A-144).  Austin,  TXi  ThS  University  of  Texas.  "  f AD— 
A031-955). 

This  study  investigated  lnteraural  time  and  amplitude  differences 
(ITD  and  IAD,  respectively)  in  low  frequency  (500  -  1000  Hz)  tones. 
Using  the  method  of  adjustment,  a  subject  was  given  a  standard  tone, 
and  adjusted  the  ITD  of  a  noise  stimulus  until  it  appeared  to  occupy 
tha  same  intracranial  position  fs  the  tone*  The  results  indicated 
that  as  tha  stimulus  Intensity  increased,  the  amount  of  ITD 
necessary  to  match  the  tone  location  decreased.  This  effect  was 
discussed  in  terms  of  encoding  latency  and  stimulus  intensity. 
However,  differences  found  in  the  time-intensity  functions  for 
different  frequencies  and  subjects  Indicate  that  other  processes  are 
Involved. 

68.  Mudd,  8.  A.,  &  McCormick,  E.  J.  (1960).  The  use  of  auditory  cues  in  a 

visual  search  task.  Journal  of  Applied  Psychology.  44.  184-188. 

This  article  investigated  use  of  auditory  signals  to  reduce  visual 
search  time*  Three  parameters  of  the  auditory  signal  were  combined 
into  one-,  two-,  or  three-dimensional  cues  used  to  Identify  a  sector 
or  groups  of  eectora  in  which  e  deviant  dial  setting  was  located. 
The  results  Indicated  that  two-  end  three-dimensional  cues 
significantly  reduced  search  time  over  the  un  Id  linens  Iona  1  group  and 
the  no-cue  control  group.  A  significant  sector-by-group  interaction 
was  found.  Apparantly  the  no-cue  group  tended  to  search  the  array 
via  a  left  to  right,  top  to  bottom  strategy,  whereas  the  cued 
groups'  strategy  was  dependent  upon  the  auditory  cue. 
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69.  Mulligan,  R.  M. ,  &  Shaw,  M.  L.  (1981).  Attending  to  simple  auditory  and 

visual  signals.  Perception  &  Psychophysics,  30.  447-454. 

This  study  Investigated  the  changes  in  performance  when  attentlonal 
demands  were  switched  between  trials.  One  group  was  presented  with 
either  auditory  or  visual  information  constant  within  a  block,  while 
another  group  was  presented  with  either  auditory  or  visual 
Information  which  changed  from  trial  to  trial.  The  affects  of 
increased  probability  of  error  in  the  divided  attention  group ,  due 
to  set  else,  were  partiallad  out  of  the  analysis.  The  results 
indicated  no  decrement  in  performance  due  to  attentlonal  switching 
demands.  It  was  concluded  that  tha  difference  between  conditions 
was  due  to  non-attentional  (set  size)  effects. 

70.  Muslcant,  A.  D. ,  &  Butler,  R.  A.  (1984a).  The  influence  of  plnnaa-based 

spectral  cues  on  sound  localisation.  The  Journal  of  the  Acoustical 
Society  of  America.  75 .  1195-1200. 

Two  experiments  were  performed  to  investigate  the  role  of  the  pinna 
in  providing  spectral  cues  for  binaural  localisation.  In  experiment 
1,  noise  bursts  of  four  types  (broadband,  4  kHs  high  pass,  4  kHz  low 
pass,  and  1  kHz  low  pass)  were  presented  to  subjects  with  either 
both  pinnae  occluded  or  both  open.  The  results  showed  a  significant 
decrease  in  localization  error  with  open  pinnae  for  the  broadband 
and  4.0  kHz  high  pass  noise  stimuli.  Experiment  2  Investigated  the 
role  of  the  near  or  far  pinna  in  front-rear  localization  reversals. 
High  pass  noise  bursts  were  presented  to  subjects  with  either  both 
pinnae  occluded,  both  pinnae  open,  the  right  (far)  pinna  occluded, 
or  the  left  (near)  pinna  occluded.  In  tarma  of  front-rear  reversals 
in  localization,  the  results  suggested  that  the  near  pinna  is 
responsible  for  perception  of  the  quadrant  of  the  source.  The 
authors  concluded  that  s  theory  of  binaural  localization  must 
Incorporate  pinnae-based  cues  to  account  for  increased  accuracy  and 
encoding  of  front  or  rear  quadrant  location. 

71.  Musicant,  A.  D. ,  t»  Butler,  R.  A.  (1984b).  The  psychophysical  basis  of 

monaural  localization.  Hearing  Research.  14 ,  185-190. 

This  study  investigated  monaural  localization  of  wide  band  noise 
bursts,  at  various  center  frequencies,  in  the  horizontal  plane.  The 
apparent  location  of  the  stimuli  were  not  dependent  upon  actual 
position,  but  rather  was  based  on  frequency  composition.  As  the 
center  frequency  Increased,  the  apparent  location  of  tha  stimulus 
shifted  from  330  to  225  degrees.  Further,  it  was  found  that  the 
location  judgments  were  related  to  the  speaker  location  which  would 
have  resulted  In  the  greatest  pinna  amplification  of  critical 
frequencies.  This  relationship,  it  was  concluded,  is  based  upon  an 
Internal  spatial  referent  which  associates  the  location  of  the 
stimulus  to  the  specific  pinna  amplification  of  the  frequency  bands. 
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72.  Nickerson,  R.  S.  (1973).  Intersensory  facilitation  of  reaction  time: 

Energy  summation  or  preparation  enhancement.  Psychological  Review, 
80,  489-509. 

This  article  reviewed  studies  which  showed  that  the  addition  of 
secondary  information  (uaually  auditory)  would  facilitate  reaction 
time  to  the  primary  stimulus  (usually  visual).  The  addition  of  a 
second  stlmulue  improves  performance  in  responding  to  the  primary 
stimulus  even  when  it  is  irrelevant  to  the  task,  Nickerson 
evaluated  these  results  in  terms  of  whether  the  second  stimulus 
increased  physiological  energy  summation  or  was  an  alerting  cue  as 
to  the  primary  stimulus.  It  wee  concluded  that  a  preparation- 
enhancement  hypothesis  was  adequate  Ko  explain  the  findings. 

73.  Nordlund,  B«  (1962),  Physical  factors  in  angular  localisation.  Acta 

Otolaryngology,  54 .  75-93. 

i 

Time,  phase,  and  amplitude  differences  between  the  two  ears  of  a 
dummy  head  wera  reassured.  Aslmuth  end  frequency  of  tonal  stimuli 
were  varied.  Time  end  phase  differences  ware  found  to  be  a  direct 
function  of  aslmuth  between  0-60*  and  120-180*.  Amplitude,  on  the 
other  hand,  tended  to  be  an  irregular  function  of  aslmuth. 
Comparisons  wars  made  with  the  theoretically  beaed  predictions  of 
Hartley  and  Fry  (1921)  and  with  the  empirical  results  of  Faddersen, 
et.  el.  (1955),  Firestone  (1930),  Mills  (1958),  and  Slvlan  and  White 
(1933).  The  article  is  a  comprehensive  and  conslse  reference  for 
binaural  time,  phase  and  amplitude  differences. 

74.  Oetmen,  L.  C.  (1975).  Simultaneous  processing  of  bleeneory  information 

Aberdeen,  MD:  U.  S.  Array' Human  Engineering  Laboratory.  (AD-A012 
149) 

The  effects  of  bleeneory  processing  of  Information  were  reviewed 
vitn  raipect  to  redundancy  of  information,  vigilance,  information 
rate,  end  detection  difficulty.  Bleeneory  facilitation  wet  found  to 
occur  whan  the  information  was  redundant,  but  only  when  the  stimuli 
were  difficult  to  detect,  recognize,  or  discriminate.  Performance 
improves  in  blaensory  tasks  only  if  the  information  Is  redundant. 
It  was  concluded  that  non-redundant  information  In  a  blsensory  task 
could,  in  high  workload  situations,  unnecessarily  increase  response 
demands . 

75.  Parrott,  D.  R.  (1982).  Studies  in  the  perception  of  auditory  motion. 

In  R.  W.  Gatehouae  (Ed.),  Localization  of  Soundt  Theory  and 
Application,  (pp.  169-193).  Groton,  CT:  Amphora  Press. 

Studies  on  the  perception  of  illusory  auditory  motion  were 
reviewed.  Using  apparent  movement  in  the  visual  system  as  a  model, 
simillar  auditory  movement  effects  (e.g.  autokinesis,  Induced 
motion,  motion  aftereffects,  auditory  blur,  and  velocity  judgements) 
wera  identified.  Auditory  movement  paradigms  have  produced 
perceived  movement  with  either  a  stationary  source  and  listener,  a 
moving  source  with  a  stationary  listener,  a  stationary  source  with  a 
moving  listener,  or  a  moving  source  and  listener.  In  dichotlc 
listening,  the  perception  of  movement  was  found  to  bo  a  direct 
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function  of  Interstimulus  onset  interval  (ISOI)  and  signal 
duration.  The  optimal  ISOI  for  perception  of  continuous  auditory 
motion  Increased  as  signal  duration  increased.  The  autokinetic 
affect  occurs  in  both  the  vertical  and  horizontal  plane,  and  its 
duration  was  found  to  be  a  decreasing  function  of  increases  in 
signal  bandwidth.  Motion  aftereffect  creates  the  perception  of 
movement  through  the  use  of  a  stationary  listener  and  source  with  a 
moving  auditory  background.  After  prolonged  exposures,  the  auditory 
source  is  perceived  as  moving  in  the  direction  opposite  the 

background.  The  paper  also  reviewed  the  perception  of  auditory 
velocity.  In  general,  listeners  were  very  consistent  in  their 

velocity  Judgements  (power  function  slope  "1.0). 

76.  Parrott,  D.  R. ,  &  Elfner,  L.  P.  (1968).  Monaural  localization.  The 

Journal  of  Auditory  Research.  ^  185-193. 

Monaural  localisation  was  compared  in  two  conditions:  one  in  which 
loudness  dlffarences  betwesn  the  two  speakers  were  equated,  and  one 
in  which  they  were  not.  Performance  was  also  compared  to  a  binaural 
condition.  The  results  Indicated  that  while  binaural  performance 

was  nssrly  errorless,  monaural  performance  with  speakers  matched  for 
loudness  was  at  chance,  and  that  monaural  performance  with  unequal 
loudneea  levels  fall  in  between*  A  second  experiment  was  conducted 
which  provided  training  for  tha  subjects,  and  added  a  second 
condition  which  reversed  loudness  (i.a.,  with  tha  left  ear  occluded 
the  left  speaker's  intensity  was  greater  than  tha  right  speaker). 
The  results  supported  the  hypothesis  that  an  intensity  cue  was  being 
used.  In  the  reversed  loudness  condition  subjects  consistently 
selected  the  speaker  with  the  greater  intensity.  No  support  was 
found  for  pinna  based  cues  being  utilised  in  either  experiment. 
When  loudness  cues  were  held  constant  performance  was  at  chance. 

77.  Perrott,  D.  R. ,  &  Musicant,  A.  D.  (1981).  Dynamic  minimus  audible 

angle:  Binaural  spatial  acuity  with  moving  sound  sources.  The 

Journal  of  Auditory  Research.  21.  287-295. 

The  Minimum  Audible  Angle  (MAA),  originally  developed  by  Mills 
(1958),  was  expanded  to  encompass  moving  sound  source  stimuli.  The 
method  of  adjustment  was  used,  whereby  the  subject  could  adjust  the 
initiation  of  a  signal  until  it  was  perceived  to  be  at  0  degrees 
azimuth.  Velocity  was  varied  over  four  conditions:  45,  60,  120, 
and  240  degrees/sac.  The  results  showed  that  dynamic  MAA  was 
comparable  to  the  static  MAA  dsvaloped  by  Mills  sxcspt  at  tha 
highest  vsloclty.  It  was  concludsd  that  the  minimum  angular 
difference  threshold  of  moving  sources  is  similar  to  the  MAA  derived 
under  static  conditions. 

78.  Pick,  H.  L. ,  Warren,  D.  H. ,  &  Hsy,  J.  L.  (1969).  Sensory  conflict  in 

Judgements  of  spatial  direction.  Perception  and  Psychophysics.  6, 
203-20  5. 

Using  an  artifical  discrepancy  paradigm,  the  dominance  of  one 
modality  over  another  was  evaluated.  The  modalities  investigated 
were  vision,  audition, and  proprioception.  The  results  indicated 
visual  Information  biased  audition  and  proprioceptive  judgments,  end 
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proprioceptive  information  biased  audition  judgments.  It  was 
concluded  that  combinations  of  sense  modalities  do  not  behave  as  an 
integrated  system;  rather,  in  certain  circumstances,  some  modalities 
Influence  responding  more  chan  others. 

79.  Platt*  B.  B. ,  &  Warren,  D*  H.  (1972).  Auditory  localisations  The 
importance  of  eye  movements  and  a  textured  visual  environment. 
Perception  and  Psychophysics.  12 .  245-248. 

Two  txpar intents  were  conducted  which  Investigated  the  role  of  eye 
movements*  in  either  light  or  derk  conditions*  and  eye-hand 
coordination  In  an  auditory  localisation  taak.  Experiment  1  showed 
thet  eye  movement*  under  lighted  conditions  wars  more  accurate  in 
locating  an  auditory  signal  than  eye-fixation  anu  no  vision 
conditions*  Experlmant  2  investigated  the  role  of  eye  movements  and 
pointing  in  lightad,  derk*  and  fixation  conditions*  Saccadic  eye 
movements  were  more  accurate  in  eh*  lighted  eye  movement  condition 
then  the  derk  eye  movement  condition*  In  addition,  the 
correspondence  between  eye  end  hand  position  was  greater  in  the 
lighted  eye  movement  condition*  It  use  concluded  that  target- 
directed  eye  movements  produced  the  moat  accurate  localisation  due 
to  the  availability  of  e  textured  environment  to  supply  information 
about  eye  position* 

80*  Pollack*  1*,  &  Rose*  M*  (1967).  Effect  of  head  movement  on  the 
localisation  sounds  in  the  equatorial  plana.  Perception  end 
Pevchophyeics.  2.  591-596. 

This  study,  in  five  experiments*  Investigated  the  conditions  under 
which  heed  movement  improved  localisation.  In  general ,  the 
conditions  ere  (1)  austeined  sound  sources,  and  (2)  sound  source 
locations  at  large  angles  outside  the  median  plena.  They  conclude 
thet  heed  movement  facilitates  localisation  by  allowing  the  listener 
to  take  advantage  of  the  higher-acuity  region  around  0*  asimuth. 

81.  Rodgars,  C.  A.  (1981).  Multidimensional  localisation:  An  Investigation 

of  poaalbla  pinnae  cues.  Dissertation  Abstracts  International.  42, 
181  IB. 

Measuramenta  of  head  related  transfer  functions  (HRTF) ,  at  24 
horlsontal  positions,  were  made  for  three  subjects,  and  were 
compared  in  both  the  time  and  frequency  domains.  The  HRTF a  differed 
between  subjects*  but  showed  simlliar  trends.  For  two  of  the  three 
eubjects*  the  HRTFs  tended  to  systematically  vary  as  a  function  of 
source  position*  Possible  cues  produced  by  the  pinna 

transformations  are  diecueeed. 

82.  Roffler,  S.  K. ,  &  Butler*  R.  A.  (1968).  Localisation  of  tonal  stimuli 

in  the  vertical  plan*.  The  Journal  of  the  Acoustical  Society  of 
America.  43,  1260-1266. 

A  series  of  experiments  investigated  the  role  of  frequency  and  body 
orientation  in  localisation  of  tonal  stimuli  in  the  vertical 
plana.  It  waa  found  that  localization  in  the  vertical  plane  was  a 
function  of  frequency.  Higher  frequencies  tended  to  be  judged  as 
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originating  higher  in  the  vertical  plane;  lower  frequencies  were 
judged  aa  originating  lower  in  the  plane*  Several  additional 
experiments  investigated  the  role  of  vision  and  body  orientation  in 
making  these  judgements*  Manipulations  included  changing  subject 
orientation  with  regard  to  the  vertical  plane,  decreasing  visual 
angle  through  Increases  in  distance  from  the  plane,  using 
congentlally  blind  subjects,  and  using  children  without  knowledge  of 
high  and  low  descriptions  of  pitch.  In  general,  the  phenomenon 
remained  consistent* 

83.  Russell,  G.  (1976).  The  role  of  the  pinna  in  monaural  horizontal  plane 

localization.  Journal  of  Auditory  Research.  16,  68-70. 

Binaural  localisation  was  compared  to  monaural  localisation  with  and 
without  pinna  cues  (open  or  occluded  pinna).  Performance  was  poorer 
in  monaural  conditions  than  binaural  conditions.  Monaural  localisa¬ 
tion  was  worse  with  the  pinna  occluded  than  with  the  pinna  open. 

84.  Sandal,  T.  T. ,  Teas,  D.  C. ,  Peddarsen,  W.  B. ,  &  Jef frees,  L*  A. 

(1935).  Localization  of  sound  from  single  and  paired  sourcea.  The 
Journal  of  the  Acoustical  3oclety  of  America.  27,  842-832. 

The  role  of  lntaraural  tine  differences  (ITDs)  in  auditory 
localisation  was  investigated.  Tones  ranging  from  300  to  3000  Hz, 
and  a  wlde-band  noise,  were  usad  as  stimuli.  The  task  consisted  of 
localising  a  sound,  presented  by  either  one  or  two  speakers,  by 
adjusting  the  location  of  the  noise  source  until  the  noise  and  tone 
ware  perceived  as  occurlng  at  the  same  location.  Thraa  experiments 
were  performed.  The  first,  using  a  single  source,  evaluated  the 
accuracy  of  the  tonal-noise  matching  task.  Experiments  2  and  3  usad 
two  speakers,  and  differed  as  to  whether  the  tonal  stimuli  were 
presented  in  phase  (Experiment  2)  or  out  of  phase  (Experiment  3). 
1TD  measurements  were  used  as  the  basis  for  predictions  of 
localization.  The  results  indicated  that  the  tonal  matching  method 
produced  accuracy  above  that  found  in  pointing  mathoda  (l.e«, 
lntermodal  matching),  and  that  the  predictlone  of  location,  based 
upon  1TD  vector  analysis,  were  consistent  with  the  data  up  to  1500 
Hz.  It  was  concluded  that  ITD  was  the  major  cue  utilized  in  low 
frequency  localization. 

85.  Searle,  C.  L.  (1982).  A  model  of  auditory  localization;  Peripheral 

constraints.  In  R.  W.  Gatehouse  (Ed.),  Localization  of  Sound; 
Theory  and  Applications  (pp.  42-50).  Groton,  CT;  Amphora  Press. 

A  block  diagram  model  of  the  physical  and  neural  processes  involved 
In  auditory  localization  was  presented.  The  decision  mechanism  was 
hypothesized  to  be  based  upon  a  decision  theory  metric,  whereby  the 
reliability  estimates  of  the  various  location  cues  are  combined  via 
a  weighted  average  to  identify  the  sound  source.  The  model 
Identifies  g;  pt  in  the  physiological  knowledge  base  that  apply  to 
Che  methods  Involved  In  localization. 
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36.  Sear  La,  C.  L. ,  Braids,  L.  D. ,  Cuddy,  D.  R. ,  &  Davit,  M.  F.  (1975). 

Binaural  pinna  dlaparlty:  Another  auditory  localization  cue.  The 

Journal  of  the  Acoustical  Society  of  America,  J7,  448-455. 

This  study  measured  pinna  transformations  In  tha  laft  and  right  ear, 

;  fo r  various  frequencies  and  altvatlons  in  tha  median  plane.  The 

results  shoved  significant  differences  between  the  transform 
functions  for  tha  two  ears.  Thaaa  differences  vara  assumed  to  be 
due  to  pinna  asymaatry.  Tha  pinna  transform  disparities  vara 
evaluated  as  possible  cues  to  aid  auditory  localization.  It  vas 
concluded  that  these  disparities  are  detectable  and  are  used  to  aid 
localization  In  tha  median  (and  poaalbly  tha  horisontal)  plane. 

I  '  - 

37*  Baarle,  C.  L.,  Bra  Id  a,  L»  D. ,  Davis,  M.  P* ,  &  Colburn,  H.  S.  (1976). 

Model  for  auditory  localisation.  The  Journal  of  tha  Acoustical 
ioclety  of  America.  6&  1164-1173. 

A  mathematical  modal  of  auditory  localization  vaa  developed  and 
validated  against  experimental  findings.  The  modal  It  based  on  the 
theory  of  signal  dstsctability.  Six  cues  for  auditory  localization 
mere  identified,  Including  lntereurel  time  and  amplitude  differences 
(ITD  end  ZAD,  respectively),  monaural  heed  ehadov,  binaural  pinna 
amplification  reaponaa,  monaural  pinna  amplification  response,  and 
•boulder  bounce.  ZTD,  ZAD,  and  monaural  head  ehadov  provided  cuai 
In  the  horisontal  plana,  vhile  the  binaural  pinna  reaponaa,  monaural 
pinna  response,  and  shoulder  bounce  provided  cute  In  both  the 
horisontal  end  vertical  plane.  The  binaural  cues  vara  assumed  to  be 
absolute  cues  to  localisation,  in  that  no  a  priori  information  vaa 
required,  while  tha  monaural  cue*  were  aaeumed  to  be  relative  cues 
to  location.  The  localisation  modal,  for  both  tha  horisontal  and 
vertical  plans,  assumed  that  head  movemente  vara  restrained,  the 
cues  vara  independent,  scelar,  gausslan  random  variables,  and  that 
tha  subject' a  reaponaa  criterion  wee  unbiased.  The  decision  vector, 
reduced  to  a  scalar  quantity  due  to  tha  above  assumptions, 
determined  source  location  based  upon  the  weighted  average  of  the 
six  cuee.  The  decision  judgement  le  based  upon  the  observer' a 
estimate  of  the  decision  scaler.  Tha  model  vas  validated  by 
comparing  tha  model's  predictions  of  standard  error  of  location 
against  tha  average  error  magnitude  of  the  empirical  results.  A 
Chl-equare  goodnaea  of  fit  tast  revealed  that  tha  modal  adequately 
fit  the  teat  date.  Tha  model  dose  not  account  for  localization  with 
head  movemente,  which,  if  Included,  should  improve  the  estimates  of 
localisation  accuracy. 

88.  Shaw,  B.  A.  Q.  (1974a).  Tha  axtarnal  ear.  In  W.  D.  Keidel  &  W.  D.  Neff 
(Ida.),  Handbook  of  Sensory  Psychology,  VoL,  5  (pp.  455-490).  New 

Yorkt  Springer-Verlag. 

This  article  reviewed  tha  acoustical  properties  of  the  pinna,  ear 
canal,  head,  and  torso.  Data  from  atvtral  sources  ware  combined  to 
yield  characteristic  information  on  tha  pinna's  sound  pressure 
changes  and  lntaraural  time  and  phase  differences.  Anthropometric 
data  on  ear  canal,  pinna  flange,  and  concha  In  human  subjects,  aa 
wall  aa  other  mammals'  ear  characteristics,  were  given.  A  review  of 
physiological  noiaa  in  tha  auditory  system  was  also  undertaken.  In 
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general,  the  paper  gives  a  comprehensive  synthesis  on  the  effects  of 
the  external  ear  on  Incoming  sound  sources* 


89.  Shaw,  E.  A.  G.  (  1974  b).  Trsnformation  of  sound  pressure  level  from  the 

far  field  to  the  eardrum  in  the  horisontal  plans.  The  Journal  of 
the  Acoustical  Society  of  America.  56 .  1848-1861. 

Twelve  studies  which  measured  pinna  transformations  were  compared. 
The  procedures  employed  to  collapse  data  over  studies  ware 
discussed,  and  families  of  curves  were  presented.  Differences 
between  studies  and  irregularities  in  the  curves  were  identified. 

90.  Shaw,  E.  A.  G.  (1982a).  External  ear  response  and  sound  localisation. 

In  R.  W.  Gatehouse  (Ed.),  Localisation  of  Sound i  Theory  and 

Application  (pp.  30-41).  Groton,  CTi  Amphora  Press. 

Research  related  to  the  role  of  the  pinna  in  sound  localisation  was 
reviewed.  Specifically,  the  means  of  spectral  encoding  (i.e. , 
filtering  models),  the  role  of  pinna  disparity  cues,  and 

Intersubject  dlffarencaa  were  identified  as  unrosolved  problems. 
Compsrlsons  were  also  made  between  human  and  KEMAR  manikin  eardrum 
response  curves. 

91.  Shaw,  E.  A.  G.  (1982b).  1979  Rayleigh  medal  lscturei  The  elusive 

connection.  In  R.  W.  Gatehouse  (Ed.),  Local last  ion  of  Sound!  Theory 
and  Applications  (pp.  13-29).  Groton,  CTt  Amphora  Press.  - 

The  work  on  pinna-based  transformations  of  auditory  stimuli  was 
reviewed,  and  the  analysis  was  extended  to  include  how  those 

transformations  are  modified  by  the  characteristics  of  tha  middle 

ear.  It  was  found  that  the  tympanic  membrane  and  ossicles  tend  to 

reduce  the  energy  availablw  at  the  oval  window.  The  inner  ear 

receives  only  10  j  of  the  energy  available  /rom  the  sound  field. 

92.  Shaw,  E.  A.  G. ,  &  Teranlshi,  R.  (1968).  Sound  pressure  generated  in  an 

external  ear  raplica  and  real  human  ears  by  a  nearby  point  source. 
The  Journal  of  the  Acoustical  Society  of  Amorica.  44.  240-349. 

Sound  pressure  changes  were  measured  at  various  azimuths  using  a 

rubber  model  of  the  pinna,  concha,  and  ear  canal.  Measurements  of 
sound  pressure  were  made  for  both  open  canal  and  blocked  ear  canal 
conditions.  The  results  were  identical  for  both  conditions  up  to  12 
kHz.  Five  acoustic  pressure  gain  maxima  are  shown  which  vary  in 
frequency  as  a  function  of  azimuth.  Comparisons  were  also  made 
using  real  ears,  and  were  in  good  agreement.  The  paper  provides 
comprehensive  data  on  pinna  effects  on  pressure  over  the  entire 

range  of  audible  frequencies. 


93.  Shelton,  B.  R. ,  Rodger,  J.  C. ,  &  Searle,  C.  C.  (1982).  The  relation 

between  vision,  head  motion  and  accuracy  of  free-fleld  auditory 
localization.  The  Journal  of  Auditory  Research.  22 ,  1-7. 

The  role  of  vision  end  head  motion  In  facilitating  auditory 
localisation  was  examined  by  restricting  vision,  head  movement,  or  a 
combination  of  the  two.  Subjects'  ability,  to  accurately  localise 
filtered  noise,  in  the  left  front  quadrant,  was  found  to  be  subject 
to  an  interaction  between  head  motion  and  visions  head  motion  only 
improves  performance  if  normal  vision  is  available.  It  was 
concluded  that  the  major  component  in  facilitating  auditory 
localisation  is  visually  fixating  the  apparent  locus  of  the  auditory 
stimulus. 

94.  Shelton,  B.  R.,  &  Seer Is,  C.  1.  (1980).  The  influence  of  vision  on  the 

absolute  identification  of  sound  source  position.  Psrcsptlon  and 

Psychophysics.  28.  389-398.  « 

Two  experiments  wars  designed  to  investigate  the  Influence  of  vision 
on  auditory  localisation.  Experiment  1  varied  speaker  orientation 
(front,  elds,  beck,  end  vertical  positions)  and  the  availability  of 
vision.  The  effects  of  vision  improved  performance  in  ell 
condition*  except  vertical.  Even  whan  the  speaker  positions  wars 
out  of  sight  (back),  the  availability  of  vision  improved 
performance.  It  wme  suggested  that  vision,  in  this  condition, 
provided  e  perceptual  anchor  or  frame  of  reference.  Experiment  2 
investigated  the  role  of  several  factors  in  visual  facilitation  of 
auditory  localisation*  Tha  rasults  suggs scad  that  vision 
facllitatss  performancs  whan  visual  and  auditory  information  is 
correlated. 

93.  Sivian,  L.  J. ,  &  Whits,  S.  D.  (1933).  On  minimum  audible  sound 
fields.  The  Journal  of  the  Acoustical  Society  of  America,  5,  288- 
321. 

This  study  measursd  tha  minimum  audible  field  (MAP),  both  monaurally 
end  blnaurally,  over  a  frequency  ’•eng#  of  100  to  13000  He,  and  ov«r 
various  eslmuths.  The  MAP  threshold  rawasure  Is  obtained  relative  to 
free  field  measurements,  and  indicates  pressure  thresholds  at  the 
observer's  head.  In  contrast,  tha  minimum  audible  pressure  (MAP)  is 
obtained  in  terras  of  pressure  thresholds  at  the  observer' a 
eardrum.  In  general,  MAP  values  relate  to  the  usual  mode  of 
hearing,  while  MAP  values  rslsts  to  ear  mechanics  (e.g.,  eardrum 
displacement).  A  U-shaped  function  of  MAP  curves  was  found  in  both 
monaural  and  binaural  conditions  at  0  degress  azimuth.  The  pressure 
necessary  for  detection  inersssad  up  to  approximately  4000  Hz,  and 
then  decreased.  Pressure  decreased  as  azimuth  Increased  and  reached 
a  maximum  whan  the  open  ear  faced  the  speaker.  The  pressure 
decreases  tended  to  get  larger  as  frequency  increased.  Comparisons 
of  MAP  and  MAP  curves  identified  lower  thresholds,  across  the 
frequency  range,  for  MAP  values  relative  to  MAF  values.  Several 
problems  in  obtaining  MAP  values  were  discussed,  mainly  concerned 
with  inaufficiant  1930' a  technology. 
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96.  Stewart,  G.  W.  (1922).  The  intensity  logarithmic  lav  and  the  difference 

of  phase  effects  in  binaural  audition.  Psychological  Monographs. 
}\j  30-44. 

This  article  evaluated  the  constant  k,  computed  for  each  subject,  in 
the  intensity  logarithmic  lavs  0  -  klogt  Ip/I^.  The  lav  relates 
the  angular  displacement  of  a  sound  source  (w)  to  the  Intensity  of 
the  source  at  the  right  and  left  ears  (I.  and  1^,  respectively). 
The  constant  k  vts  found  to  very  considerably  betvaen  subjects,  and 
within  an  individual,  k  decreased  as  frequency  increased.  The 
perceived  angular  displacement  of  a  sound  source  was  also  measured 
as  phase  relationships  between  the  two  tones  and  the  ear  were 
varied.  There  tended  to  be  an  upper  frequency  limit  of  1260  He 
through  which  the  alteration  of  phase  produced  a  fused  auditory 
Image.  These  effects  have  bean  well  documented  ae  localisation 
cues,  and  now  are  referred  to  as  interaural  amplitude  and  time  or 
phase  difference  cues* 

97.  Thurlow,  W.  R. ,  Mangels,  J.  W. ,  &  Rungs,  D,  S.  (1967).  Head  movements 

during  sound  localisation.  The  Journal  of  the  Acoustical  Society  of 
America.  42 ,  489-493. 

The  Thurlow  and  Runge  (1967)  study  of  induced  head  motion  was 
extended  in  this  paper  through  the  evaluation  of  subject-induced 
head  movement.  Subjects  were  presented  with  noise  signals,  in 
either  the  horlsontal  or  vertical  plana,  and  their  head  movements 
used  to  localize  the  sound  were  filmed.  It  was  predicted  that 
rotational  and  pivot  movements  would  produce  the  greatest  change  in 
interaural  time  and  intensity  cues,  and  would,  therefore,  be  used 
most  often.  In  terms  of  frequency,  subjects  tended  to  employ 
rotational,  rotatlonal-tlp-and-plvot,  and  rotational-pivot  movements 
most  often.  Rotational  movements  are  larger  (in  degrees)  than  tip 
on  pivot  movements.  Subjects  also  tended  to  rotate  and  tip  their 
heads  towards  the  sound  source,  but  usually  not  enough  to  directly 
face  the  energized  speaker.  It  was  concluded  that  most,  but  not 
all,  subjects  will  naturally  move  their  heads  to  aid  in  localizing 
sound  sources. 

98.  Thurlow,  W.  R.  ,  &  Runge,  P.  S.  (1957).  Effect  of  Induced  haad  movements 

on  localization  of  direction  of  sounds.  The  Journal  of  the 
Acoustical  Society  of  America.  42  ,  480-488. 

Three  experiments  investigated  the  effects  of  induced  heed  motion  in 
the  horlsontal  and  vertical  planes.  Five  types  of  movement  were 
evaluated:  no  movement,  rotation,  pivot,  tip,  rotate-pivot  and  free 
head  movement.  Head  movement  was  directly  controlled  by  the 
experimenters.  The  results  indicated  a  reduction  in  front-back 
reversals  and  in  horizontal  localization  accuracy  with  movement. 
Vertical  location  accuracy  improved  only  slightly  with  head 
movement.  Rotational  movement  provided  greater  accuracy  than  pivot 
and  tip  movements.  It  was  concluded  that  Wallach's  (1940) 

theoretical  discussion  on  the  importance  of  head  movements  waa 
empirically  supported. 


211 


99 


Wallach,  H.  (1940).  The  role  of  head  movements  and  vestibular  and 
visual  cues  in  sound  localization.  Journal  of  Experimental 
Psychology.  2]_,  339-368. 

The  effects  of  heed  movement  (either  actual  or  perceived)  on 
auditory  localization  was  demonstrated.  In  one  experiment  the 
movement  of  a  subject's  head  controlled  the  sound  source  such  that 
It  swung  through  an  angla  twica  tha  alee  of  the  head  movement.  The 
perception  of  sound  source  location  was  approximately  a  mirror  image 
(180*  reversed)  from  the  actual  location*  Only  when  the  source 
remained  constant  did  tha  subject  accurately  localize  the  source. 
Passive  end  apparent  movement  produced  identical  results. 

100.  Warren,  D.  H.  (1970).  Intermodallty  interactions  In  spatial 

localization.  Cognitive  Psychology.!.  114-133. 

A  series  of  experiments  Investigated  the  role  of  vision  and  eye 
movements  in  auditory  localization.  In  general,  the  results 
indlcsted  that  visual  input  Improvad  auditory  localisation  only  when 
a  atructurad  visual  anvironrnant  was  available.  In  the  dark,  or  with 
eyes  closed,  auditory  localization  was  poorer  then  with  eyes  open. 
The  availability  of  a  visual  anvironrnant  apparently  allows  tha 
listener  to  map  the  auditory  stimulus  onto  s  viaual  representation 
of  the  environment. 

101.  Watkins,  A.  J.  (1978).  Paychoacoustlcal  aspects  of  synthesized  vertical 

locale  cues.  The  Journal  of  the  Acoustical  Society  of  America.  63. 
1132-1165. 

Pinna  transformations  were  simulated  through  a  two  variable  latency 
delay  and  add  process.  Using  Betteeu'e  (1967,  1968)  data  as 

paramstars,  tha  computar  simulation  waa  abla  to  product  apparent 
movamant  in  the  vertical  plane  similar  to  pravloue  findings.  A 
quantitative  model  of  location  decoding  wee  proposed.  The  model  Is 
baead  upon  spectral-pattern  recognition  (l.a.,  autocorrelation). 
Tha  predictions  of  the  model  were  coneletant  with  the  apparent 
elevation  of  the  stimuli. 

102.  Watkins,  A.  J.  (1982).  The  monaural  perception  of  ezimuthi  A  synthesis 

approach.  In  R.  W.  Gatehouse  (Ed.),  Localization  of  Sound;  Theory 
and  Application  (pp.  194-206).  Groton,  C*r»  Amphora  Press. 

This  article  expanded  upon  Batteau's  (1967)  work  on  the  directional 
band  theory  of  eound  localization.  The  theory  proposes  that  the 
pinna  produce  two  distinct  echoes  which  vary  In  latency  and  are  used 
aa  cuaa  to  define  the  sound  source  location.  Three  experiments  were 
performed,  using  monaural  cuts  presented  through  a  single  earphone. 
The  reeults  showed  that  subjects  psreeive  changes  Ln  azimuth 
dependent  upon  the  interaction  of  the  two  time  parameters  associated 
with  the  two  echoes.  Tht  results  supported  the  predictions  made  by 
dirscclonal  band  thsory. 
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103.  Weiner,  F.  M.  (1947).  On  Che  diffraction  of  a  progressive  sound  wave  by 


the  human  head.  The  Journal  of  the  Acoustical  Society  of  America, 
J£,  143-146. 

An  earlier  paper  by  Wiener  and  Rosa  (1946)  investigated  changes  in 
sound  pressure  from  the  entrance  to  the  ear  canal  to  the  eardrum. 
In  general,  sound  pressure  increaeed,  especially  at  higher 
frequencies,  at  the  eardrun  relative  to  the  ear  canal.  This  data 
was  then  combined  with  data  from  free  field  end  eer  canal 
maasuraments,  to  obtain  a  measure  of  pressure  changes  due  to  both 
head  diffraction  and  ear  canal  factors  at  the  eardrum*  Measurements 
wars  taken  over  various  frequencies  and  aslmuthal  positions.  It  was 
concluded  that  pressure  changes  at  the  eardrum  are  sufficient, 
especially  at  higher  frequencies,  to  encode  aslmuthal  changee. 

104.  Weinrich,  L.  (1982).  The  problem  of  front-back  localisation  in  binaural 

hearing.  In  0.  J.  Pedersen  and  T.  Poulsen  (Eds.),  Binaural  Effects 
in  Normal  and  Impaired  Hearing  (Scandinavian  Audiological 
Supplement,  No.  IS,  pp.  135-l45^. 

Intensity  measurements  of  pinna  transfer  functions  were  msde.  It 
wae  concluded  chat  changes  in  the  functions  were  available  as  e  cue, 
without  head  movements,  to  reduce  directional  inversions.  Using 
these  transfer  functions,  it  was  found  that  subjects  were  able, 
whether  or  not  the  signal  characteristics  wire  modified,  to  localise 
the  sound  source  without  front-back  inversions.  These 

modifications,  in  the  500  -  7000  H*  range,  consisted  of  manipulating 
the  tonal  characteristics  of  the  noise  bursts  such  that  the  transfer 
functions  were  taken  from  one  orientation  (t.e.,  speaker  location) 
and  presented  at  another  location. 

105.  Welsh,  R.  B. ,  &  Warren,  D.  H.  (1980).  Immediate  perceptual  response  to 

interseneory  diacrepency.  Psychological  Bulletin.  88,  638-667. 

A  review  of  the  interseneory  bias  literature  was  performed  with 
regards  to  sensory  modality,  the  type  of  discrepancy  (either 

perceptual  or  cognitive) ,  and  response  variables.  The  previous 
theories  are  unable  to  account  for  all  the  results  of  interseneory 
bias  and  perceptual.  adaptation.  A  modal  is  proposed  to  account  for 
the  effects  of  stimulus  situations,  modality  characteristics, 
cognitive  factors,  and  response  outcomes.  The  model  is  broad  enough 
to  cover  the  known  occurrences  of  Intaraansory  bias. 

106.  Wilbanks,  W.  A.  (1983).  Masking  of  the  signal  for  lateralization  of 

tones.  Bulletin  of  the  Psychonomic  Society,  2 1 ,  138-140. 

This  study  evaluated  the  change  in  interaural  phase  necessary  to 

detect  lateralization  of  tones  under  conditions  of  different  source 
frequency  and  interaural  correlations  of  background  wido  band  noise. 
Using  Zwislocki  and  Feldman's  (1956)  data  aa  a  baselina  for  no  noiae 
(*2°  phase  difference),  the  results  showsd  that  as  the  tonal 
frequency  Increased  phase  adjustments  increased,  and  as  the 
interaural  noise  correlation  decreased  phase  adjustments  increased. 
With  noise  correlation  equaling  +1.0  for  a  500  Hz  tone,  the  phase 
adjustment  Increased  from  Zwislocki  and  Feldman's  no  noise  condition 


to  8°.  It  la  concluded  that  both  the  frequency  and  Interaural  noise 
correlations  are  Important  variables  In  attempting  to  laterallze  a 
tonal  signal  imbedded  in  background  noise. 

107.  Wright,  D. ,  Hebrenk,  J.  H.  &  Wilson,  B.  (1974).  Pinna  reflections  -o 

cue*  for  localization.  The  Journal  of  the  Acoustical  Socle’-/  of 
America.  56.  957-962. 

Thia  study  Investigated  the  monaural  difference  threshold  of  tine 
delays.  Batteau  (1967)  proposed  Chat  pinna  reflections  Introduced 
time  delays  of  10  to  100  microseconds,  which  were  dependent  upon 
sound  source  direction.  It  was  found  that  delay  times  as  little  as 
20  microseconds  were  recognizable  when  the  amplitude  ratio  of  the 
two  signals  were  greater  than  .67,  and  that  JND  values  were 
consistent  with  previously  obtained  minimum  audible  angles  for 
monaural  localisation.  These  findings  support  Betteau's  theory. 

108.  Zwlslockl,  J, ,  &  Feldman,  R.  S.  (1956).  Just  noticeable  differences  In 

dlchotlc  phase.  The  Journal  of  the  teoustical  Society  of  America. 
28,  860-864. 

Phase  differences,  in  a  dichotic  listening  task,  ware  directly 
manipulated  to  assess  the  relationship  between  changes  in  phase  end 
the  more  commonly  obtained  changes  in  times.  Just  noticeable 
differences  (JNDe)  were  obtained  from  three  subjects,  using  a  forced 
choice  paired  comparison  method  in  which  the  Intracranial  location 
of  a  standard  wee  judged  relative  to  a  test  stimulus.  Both 
frequency  end  intensity  were  varied.  The  results  indicated  that 
phase  discrimination  between  the  two  stimuli  wms  poorest  at  both 
high  and  low  intensities,  end  that  as  frequency  increased  the  JND 
for  phase  difference  increased.  Interaural  phase  differences  on  the 
order  of  2s  were  perceived  ee  e  shift  tn  the  location  of  the  test 
stimulus.  These  results  were  consistent  with  other  findings,  and 
supported  the  notion  that  phase  differences  in  localization  are 
effective  only  at  the  lower  frequencies. 
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The  numbers  under  each  topic  refer  to  the  annotation  number  In  the 
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